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10 BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to polypeptide -polymer conjugates 
having added and/or removed one or more attachment groups for coupling 
polymeric molecules on the surface of the 3D structure of the 
15 polypeptide, a method for preparing polypeptide-polymer conjugates of 
the invention, the use of said conjugated for reducing the 
immunogenicity and allergenicity , and compositions comprising said 
conjugate . 

20 Description of Related Art 

The use of polypeptides, including enzymes, in the circulatory 
system to obtain a particular physiological effect is well-known in the 
medical arts. Further, within the arts of industrial applications, such 
as laundry washing, textile bleaching, person care, contact lens 

25 cleaning, food and feed preparation enzymes are used as a functional 
ingredient. One of the important differences between pharmaceutical and 
industrial application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

3 0 Certain polypeptides and enzymes have an unsatisfactory stability 

and may under certain circumstances - dependent on the way of challenge 
- cause an immune response, typically an IgG and/ or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 

3 5 polypeptides, such as enzymes, are coupled to polymeric molecules. It is 
believed that the reduced immune response is a result of the shielding 
of (the) epitope (s) on the surface of the polypeptide responsible for 
the immune response leading to antibody formation by the coupled 
polymeric molecules . 
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Techniques for conjugating polymeric molecules to polypeptides are 
well-known in the art. 

One of the first commercially suitable techniques was described 
back in the early 1970 's and disclosed in e.g. US Patent No. 4,179,337. 
5 Said patent concerns non- immunogenic polypeptides, such as enzymes and 
peptide hormones coupled to polyethylene glycol (PEG) or polypropylene 
glycol (PPG) . At least 15% of polypeptides' physiological activity is 
maintained. 

GB patent no. 1,183,257 (Crook et al . ) describes chemistry for 
10 conjugation of enzymes to polysaccharides via a triazine ring. 

Further, techniques for maintaining of the enzymatic activity of 
enzyme -polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al . ) concerns a method for maintaining 
the activity in polyethylene glycol -modified proteolytic enzymes by 
15 linking the proteolytic enzyme to a macromolecularized inhibitor. The 
conjugates are intended for medical applications. 

It has been found that the attachment of polymeric molecules to a 
polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the polypeptide 
20 and its substrate. EP 183 503 (Beecham Group PLC) discloses a 
development of the above concept by providing conjugates comprising 
pharmaceutical ly useful proteins linked to at least one water-soluble 
polymer by means of a reversible linking group. 

EP 471,125 (Kanebo) discloses skin care products comprising a 
25 parent protease {Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability. The coupling technique used is also 
described in the above mentioned GB patent no. 1,183,257 (Crook et al . ) . 
JP 3083908 describes a skin cosmetic material which contains a 
3 0 transglutaminase from guinea pig liver modified with one or more 
water-soluble substances such as PEG, starch, cellulose etc. The 
modification is performed by activating the polymeric molecules and 
coupling them to the enzyme. The composition is stated to be mild to 
the skin. 

3 5 However, it is not always possible to readily couple polymeric 

molecules to polypeptides and enzymes. Further, there is still a need 
for polypeptide-polymer conjugates with an even more reduced 
immunogenicity and/or allergenicity . 
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SUMMARY OF THE INVENTION 

It is the object of the present invention to provide improved 
polypeptide -polymer conjugates suitable for industrial and 
pharmaceutical applications. 
5 The term "improved polypeptide -polymer conjugates" means in the 

context of the present invention conjugates having a reduced immune 
response in humans and animals and/or an improved stability. As will be 
described further below the immune response is dependent on the way of 
challenge . 

10 The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/or allergenic by adding and/or 
removing one or more attachment groups on the surface of the parent 
polypeptide to be coupled to polymeric molecules. 

When introducing pharmaceutical polypeptide directly into the 

15 circulatory system (i.e. bloodstream) the potential risk is an 
immunogenic response in the form of mainly IgG, IgA and/or IgM 
antibodies. In contrast hereto, industrial polypeptides, such as enzymes 
used as a functional ingredient in e.g. detergents, are not intended to 
enter the circulatory system. The potential risk in connection with 

20 industrial polypeptides is inhalation causing an allergenic response in 
the form of mainly IgE antibody formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides . 

25 The main potential risk of pharmaceutical polypeptides is 

immunogenicity caused by intradermal, intravenous or subcutaneous 
presentation of the polypeptide. 

It is to be understood that reducing the * immunogenicity" and 
reducing the "respiratory allergenicity" are two very different 

3 0 problems based on different routes of exposure and on two very 
different immunological mechanisms: 

The term " immunogenicity" used in connection with the present 
invention may be referred to as allergic contact dermatitis in a 
clinical setting and is a cell mediated delayed immune response to 

35 chemicals that contact and penetrate the skin. This cell mediated 
reaction is also termed delayed contact hypersensitivity (type IV reac- 
tion according to Gell and Combs classification of immune mechanisms in 
tissue damage) . 



3 



The term "allergenicity" or "respiratory allergenicity" is an 
immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Gell and Combs) following inhalation of e.gr. polypeptides. 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/or improved stability, 
which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" . 

10 In the first aspect the invention relates to a polypeptide -polymer 

conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the number of 
attachment groups on the surface of the polypeptide in comparison to the 

15 number of attachment groups available on the corresponding parent 
polypeptide , and/or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the number of 
attachment groups at or close to the functional site(s) of the 

2 0 polypeptide in comparison to the number of attachment groups available 
on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent polypeptide may 
be a naturally-occurring (or wild-type) polypeptide or may be a variant 

2 5 thereof prepared by any suitable means. For instance, the parent 

polypeptide may be a variant of a naturally-occurring polypeptide which 
has been modified by substitution, deletion or truncation of one or more 
amino acid residues or by addition or insertion of one or more amino 
acid residues to the amino acid sequence of a naturally-occurring 

3 0 polypeptide. 

A "suitable attachment group" means in the context of the present 
invention any amino acid residue group on the surface of the polypeptide 
capable of coupling to the polymeric molecule in question. 

Preferred attachment groups are amino groups of lysine residues 
3 5 and the N- terminal amino group. Polymeric molecules may also be coupled 
to the carboxylic acid groups (-COOH) of amino acid residues in the 
polypeptide chain located on the surface. Carboxylic acid attachment 
groups may be the carboxylic acid group of aspartate or glutamate and 
the C- terminal COOH-group. 
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A "functional site" means any amino acid residues and/ or cof actors 
which are known to be essential for the performance of the polypeptide, 
such as catalytic activity, e.g. the catalytic triad residues, 
histidine, aspartate and serine in serine proteases, or e.g. the heme 
5 group and the distal and proximal histidines in a peroxidase such as the 
Arthromyces ramosus peroxidase . 

In the second aspect the invention relates to a method for 
preparing improved polypeptide -polymer conjugates comprising the steps 
of: 

10 a) identifying amino acid residues located on the surface of the 

3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 
3D structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
15 selected in step b) with an amino acid residue having a suitable 

attachment group, and/ or ii) substituting or deleting one or more amino 
acid residues selected in step b) at or close to the functional site(s) , 

d) coupling polymeric molecules to the mutated polypeptide. 

The invention also relates to the use of a conjugate of the 

2 0 invention and the method of the invention for reducing the 

immunogenicity of pharmaceuticals and reducing the allergenicity of 
industrial products . 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in industrial 
25 products or pharmaceuticals. 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1 shows the anti-lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase variant, iii) 
30 lipase variant-SPEG. (X; log(serum dilution); Y Optical Density 
(490/620) ) . 

DETAILED DESCRIPTION OF THE INVENTION 

It is the object of the present invention to provide improved 

3 5 polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications. 

Even though polypeptides used for pharmaceutical applications and 
industrial application can be quite different the principle of the 
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present invention may be tailored to the specific type of parent 
polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
5 comparison to conjugates prepared from the corresponding parent 
polypeptides. 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by adding 
one or more attachment groups on the surface of the parent polypeptide. 

10 In addition thereto the inventors have found that a higher percentage of 
maintained residual functional activity may be obtained by removing 
attachment groups at or close to the functional site(s). 

In the first aspect the invention relates to an improved 
polypeptide-polymer conjugate having 

15 a) one or more additional polymeric molecules coupled to the 

polypeptide having been modified in a manner to increase the number of 
attachment groups on the surface of the polypeptide in comparison to the 
number of attachment groups available on the corresponding parent 
polypeptide , and/or 

2 0 b) one or more fewer polymeric molecules coupled to the 

polypeptide having been modified in a manner to decrease the number of 
attachment groups at or close to the functional site(s) of the 
polypeptide in comparison to the number of attachment groups available 
on the corresponding parent polypeptide. 

2 5 Whether the attachment groups should be added and/ or removed 

depends on the specific parent polypeptide. 

a) Addition of Attachment groups 

There may be a need for further attachment groups on the 

3 0 polypeptide if only few attachment groups are available on the surface 

of the parent polypeptide. The addition of one or more attachment groups 
by substituting or inserting one or more amino acid residues on the 
surface of the parent polypeptide increases the number of polymeric 
molecules which may be attached in comparison to the corresponding 
35 parent polypeptide. Conjugates with an increased number of polymeric 
molecules attached thereto are generally seen to have a reduced immune 
response in comparison to the corresponding conjugates having fewer 
polymeric molecules coupled thereto. 
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Any available amino acid residues on the surface of the 
polypeptide, preferentially not being at or close to the functional 
site(s), such as the active site(s) of enzymes, may in principle be 
subject to substitution and/or insertion to provide additional 
5 attachment groups. 

As will be described further below the location of the additional 
coupled polymeric molecules may be of importance for the reduction of 
the immune response and the percentage of maintained residual functional 
activity of the polypeptide itself. 
10 A conjugate of the invention may typically have from 1 to 25, 

preferentially 1 to 10 or more additional polymeric molecules coupled to 
the surface of the polypeptide in comparison to the number of polymeric 
molecules of a conjugate prepared on the basis of the corresponding 
parent polypeptide. 

15 However, the optimal number of attachment groups to be added 

depends (at least partly) on the surface area (i.e. molecular weight) of 
the parent polypeptide to be shielded by the coupled polymeric 
molecules, and also on the number of already available attachment groups 
on the parent polypeptide. 

20 

b) Removing Attachment groups 

In the case of enzymes or other polypeptides performing their 
function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the interaction 

2 5 between the polypeptide and its substrate or the like, if they are 

coupled at or close to the functional site(s) (i.e. active site of 
enzymes) . This will most probably cause reduced activity. 

In the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of residual 

3 0 enzymatic activity can be expected. Therefore, according to the 

invention conjugates may be constructed to maintain a higher percentage 
of residual enzymatic activity in comparison to a corresponding 
conjugates prepared on the basis of the parent enzyme in question. This 
may be done by substituting and/or deleting attachment groups at or 
3 5 close to the active site, hereby increasing the substrate affinity by 
improving the accessibility of the substrate in the catalytic cleft. 

An enzyme -polymer conjugate of the invention may typically have 
from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled at or 
close to the active site in comparison to the number of polymeric 
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molecules of a conjugate prepared on the basis of the corresponding 
parent polypeptide. 

As will be explained below "at or close to" the functional site(s) 
means that no polymeric molecule (s) should be coupled within 5 
5 Angstroms, preferably 8 Angstroms, especially 10 Angstroms of the 
functional site(s). 

Removal of attachment groups at or close to the functional site(s) 
of the polypeptide may advantageously be combined with addition of 
attachment groups in other parts of the surface of the polypeptide. 
10 The total number of attachment groups may this way be unchanged, 

increased or decreased. However the location (s) of the total number of 
attachment group (s) is (are) improved assessed by the reduction of the 
immune response and/ or percentage of maintained residual activity. 
Improved stability may also be obtained this way. 

15 

The number of attachment groups 

Generally seen the number of attachment groups should be balanced 
to the molecular weight and/or surface area of the polypeptide. The more 
heavy the polypeptide is the more polymeric molecules should be coupled 
20 to the polypeptide to obtain sufficient shielding of the epitope (s) 
responsible for antibody formation. 

Therefore, if the parent polypeptide molecule is relatively light 
(e.g. 1 to 35 kDa) it may be advantageous to increase the total number 
of coupled polymeric molecules (outside the functional site(s)) to a 
25 total between 4 and 20. 

If the parent polypeptide molecules are heavier, for instance 35 
to 60 kDa, the number of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, and so 
on. 

3 0 The ratio between the molecular weight (Mw) of the polypeptide in 

question and the number of coupled polymeric molecules considered to be 
suitable by the inventors is listed below in Table 1. 
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Table 1 



Molecular weight of parent 
polypeptide (M w ) kDa 


Number of polymeric molecules 
coupled to the polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


More than 100 


more than 20 



Reduced immune response vs. maintained residual enzymatic activity 

Especially for enzymes, in comparison to many other types of 
5 polypeptides, there is a conflict between reducing the immune response 
and maintaining a substantial residual enzymatic activity as the 
activity of enzymes are connected with interaction between a substrate 
and the active site often present as a cleft in the enzyme structure. 

Without being limited to any theory it is believed that the loss 
10 of enzymatic activity of enzyme -polymer conjugates might be a 
consequence of impeded access of the substrate to the active site in the 
form of spatial hindrance of the substrate by especially bulky and/or 
heavy polymeric molecules to the catalytic cleft. It might also, at 
least partly, be caused by disadvantageous minor structural changes of 
15 the 3D structure of the enzyme due to the stress made by the coupling of 
the polymeric molecules. 

Maintained residual activity 

A polypeptide-polymer conjugates of the invention has a 
2 0 substantially maintained functional activity. 

A "substantially" maintained functional activity is in the context 
of the present invention defined as an activity which is at least 
between 20% and 3 0%, preferably between 3 0% and 40%, more preferably 
between 40% and 60%, better from 60% up to 80%, even better from 80% up 
25 to about 100%, in comparison to the activity of the conjugates prepared 
on the basis of corresponding parent polypeptides. 

In the case of polypeptide-polymer conjugates of the invention 
where no polymeric molecules are coupled at or close to the functional 
site(s) the residual activity may even be up to 100% or very close 
30 thereto. If attachment group (s) of the parent polypeptide is (are) 
removed from the functional site the activity might even be more than 
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100% in comparison to modified (i.e. polymer coupled) parent polypeptide 
conjugate . 

Position of coupled polymeric molecules 
5 To obtain an optimally reduced immune response (i.e. immunogenic 

and allergenic response) the polymeric molecules coupled to the surface 
of the polypeptide in question should be located in a suitable distance 
from each other. 

In a preferred embodiment of the invention the parent polypeptide 
10 is modified in a manner whereby the polymeric molecules are spread 
broadly over the surface of the polypeptide. In the case of the 
polypeptide in question has enzymatic activity it is preferred to have 
as few as possible, especially none, polymeric molecules coupled at or 
close to the area of the active site. 
15 In the present context "spread broadly over the surface of the 

polypeptide" means that the available attachment groups are located so 
that the polymeric molecules shield different parts of the surface, 
preferably the whole or close to the whole surface area away from the 
functional site(s), to make sure that epitope (s) are shielded and hereby 

2 0 not recognized by the immune system or its antibodies. 

The area of antibody-polypeptide interaction typically covers an 
area of 500 Angstroms 2 , as described by Sheriff et al . , 1987, Proc. 
Natl. Acad. Sci. USA, 84, 8075-8079. 500 Angstroms 2 corresponds to a 
rectangular box of 2 5 Angstroms x 20 Angstroms or a circular region of 
25 radius 12.6 Angstroms. Therefore, to prevent binding of antibodies to 
the epitope (s) to the polypeptide in question it is preferred to have 
a maximum distance between two attachment groups around 10 Angstroms. 

Consequently, amino acid residues which are located in excess of 
10 Angstroms away from already available attachment groups are 

3 0 suitable target residues. If two or more attachment groups on the 

polypeptide are located very close to each other it will in most cases 
result in that only one polymeric molecule will be coupled. To ensure a 
minimal loss of functional activity it is preferred not to couple 
polymeric molecules at or close to the functional site(s). Said distance 
3 5 depends at least partly on the bulkiness of the polymeric molecules to 
be coupled, as impeded access by the bulky polymeric molecules to the 
functional site is undesired. Therefore, the more bulky the polymeric 
molecules are the longer should the distance from the functional site to 
the coupled polymeric molecules be. 
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To maintain a substantial functional activity of the polypeptide 
in question attachment groups located within 5 Angstroms, preferred 8 
Angstroms, especially 10 Angstroms from such functional site(s) should 
be left uncoupled and may therefore advantageously be removed or 
5 changed by mutation. Functional residues should normally not be 
mutated/removed, even though they potentially can be the target for 
coupling polymeric molecules. In said case it may thus be advantageous 
to choose a coupling chemistry involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 

10 molecules at (a) known epitope (s) recognizable by the immune system or 
close to said epitope (s) specific mutations at such sites are also 
considered advantageous according to the invention. If the position of 
the epitope (s) is (are) unknown it is advantageous to couple several or 
many polymeric molecules to the polypeptide. 

15 As also mentioned above it is preferred that said attachment 

groups are spread broadly over the surface. 

The attachment group 

Virtually all ionized groups, such as the amino groups of lysine 
2 0 residues, are located on the surface of the polypeptide molecule (see 
for instance Thomas E. Creighton, 1993, "Proteins", W.H. Freeman and 
Company, New York) . 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals generally 

2 5 seen the number of lysine residues in the primary structure of the 

polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups are 
quite simple and well established in the art. Therefore, it is preferred 
to add and/or remove lysine residues (i.e. attachment groups) to/from 

3 0 the parent polypeptide in question to obtain improved conjugates with 

reduced immunogenicity and/or allergenicity and/or improved stability 
and/or high percentage maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic groups 
(-COOH) of amino acid residues on the surface of the polypeptide. 
35 Therefore, if using carboxylic groups (including the C- terminal group) 
as attachment groups addition and/or removal of aspartate and glutamate 
residues may also be suitable according to the invention. 

If using other attachment groups, such as -SH groups, they may be 
added and/or removed analogously. 
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Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide normally 
will be less pronounced. 

Preferred substitutions are conservative substitutions. In the 
5 case of increasing the number of attachment groups the substitution may 
advantageously be performed at a location having a distance of 5 
Angstroms, preferred 8 Angstroms, especially 10 Angstroms from the 
functional site(s) (active site for enzymes). 

An example of a suitable conservative substitution to obtain an 
10 additional amino attachment group is an arginine to lysine substitution. 
Examples of conservative substitutions to obtain additional carboxylic 
attachment groups are aspargine to aspartate/glutamate or glutamine to 
aspartate/glutamate substitutions. To remove attachment groups a lysine 
residue may be substituted with an arginine and so on. 

15 

The parent polypeptide 

In the context of the present invention the term "polypeptides" 
includes proteins, peptides and/ or enzymes for pharmaceutical or 
industrial applications. Typically the polypeptides in question have a 

2 0 molecular weight in the range between about 1 to 100 kDa, often 15 kDa 

and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypeptides, 
25 including peptides, such as peptide hormones, proteins and/or enzymes, 
being physiologically active when introduced into the circulatory system 
of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

3 0 Examples of ''pharmaceutical polypeptides" contemplated according 

to the invention include insulin, ACTH, glucagon, somatostatin, 
somatotropin, thymosin, parathyroid hormone, pigmentary hormones, 
somatomedin, erythropoietin, luteinizing hormone, chorionic 
gonadotropin, hypothalmic releasing factors, antidiuretic hormones, 
35 thyroid stimulating hormone, relaxin, interferon, thrombopoietin (TPO) 
and prolactin. 
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Industrial polypeptides 

Polypeptides used for industrial applications often have an 
enzymatic activity. Industrial polypeptides (e.g. enzymes) are (in 
contrast to pharmaceutical polypeptides) not intended to be introduced 
5 into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as enzymes 
used as ingredients in industrial compositions and/or products, such as 
detergents and personal care products, including cosmetics, come into 
direct contact with the circulatory system of the body of humans or 
10 animals, as such enzymes (or products comprising such enzymes) are not 
injected (or the like) into the bloodstream. 

Therefore, in the case of the industrial polypeptide the potential 
risk is respiratory allergy (i.e. IgE response) as a consequence of 
inhalation to polypeptides through the respiratory passage. 
15 In the context of the present invention "industrial polypeptides" 

are defined as polypeptides, including peptides, proteins and/or 
enzymes, which are not intended to be introduced into the circulatory 
system of the body of humans and/or animals . 

Examples of such polypeptides are polypeptides, especially 
20 enzymes, used in products such as detergents, household article prod- 
ucts, agrochemicals, personal care products, such as skin care products, 
including cosmetics and toiletries, oral and dermal pharmaceuticals, 
composition use for processing textiles, compositions for hard surface 
cleaning, and compositions used for manufacturing food and feed etc. 

25 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of enzymes 
including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, (1992), 
30 Academic Press, Inc.), such as laccase and Superoxide dismutase (SOD); 
Transferases, (E.C. 2), such as transglutaminases (TGases) ; Hydrolases 
(E.C. 3), including proteases, especially subtilisins, and lipolytic 
enzymes; Isomerases (E.C. 5), such as Protein disulfide Isomerases 
(PDI) . 

35 

Hydrolases 

Proteolytic enzymes 

Contemplated proteolytic enzymes include proteases selected from 
the group of Aspartic proteases, such pepsins, cysteine proteases, such 
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as papain, serine proteases, such as subtilisins, or metallo proteases, 
such as NEUTRASE®. 

Specific examples of parent proteases include PD498 (WO 93/24623 
and SEQ ID NO: 2), SAVINASE® (von der Osten et al . , 1993, Journal of 
5 Biotechnology, 28, 55+, SEQ ID NO: 3), Proteinase K (Gunkel et al . , 
1989, Eur. J. Biochem, 179, 185-194), Proteinase R (Samal et al, 1990, 
Mol. Microbiol, 4, 1789-1792), Proteinase T (Samal et al . , 1989, Gene, 
85, p. 329-333), Subtilisin DY (Betzel et al . 1993, Arch. Biophys, 
302(2), 499-502), Lion Y (JP 04197182-A) , RENNILASE® (Available from 
10 Novo Nordisk A/S) , JA16 (WO 92/17576), ALCALASE® (a natural subtilisin 
Carlberg variant) (von der Osten et al . , 1993, Journal of Biotechnology, 
28, 55+) . 

Lipolytic enzymes 

15 Contemplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See SEQ ID 
NO: 6 below), Humicola insolens, a Rhizomucor miehei lipase, e.g. as 
described in EP 238 023, Absidia sp. lipolytic enzymes (WO 96/13578), a 
Candida lipase, such as a C. antarctica lipase, e.g. the C. antarctica 

2 0 lipase A or B described in EP 214 761, a Pseudomonas lipase such as a P. 
alcaligenes and P. pseudoalcaligenes lipase, e.g. as described in EP 218 
272, a P. cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas 
sp. lipase as disclosed in WO 95/14783, a Bacillus lipase, e.g. a B. 
subtilis lipase (Dartois et al . , 1993 Biochemica et Biophysica Acta 

25 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) and a B. 
pumilus lipase (WO 91/16422) . Other types of lipolytic enzymes include 
cutinases, e.g. derived from Pseudomonas mendocina as described in WO 
88/09367, or a cutinase derived from Fusarium solani pisi {e.g. 
described in WO 90/09446) . 

30 

Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290) , Myceliophthora laccase (WO 95/33836) , Scytalidium laccase (WO 
35 95/338337), and Pyricularia oryzae laccase (Available from Sigma,). 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858), Myxococcaceae peroxidase (WO 95/11964), Coprinus cinereus (WO 
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95/10602} and Arthromyces ramosus peroxidase (Kunishima et al . 1994, J. 
Mol. Biol., 235, 331-344). 

Transferases 

5 Transglutaminases 

Suitable transferases include any transglutaminases disclosed in 
WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk A/S) . 

Isomer as es 

10 Protein Disulfide Isomerase 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk A/S) . 

The polymeric molecule 
15 The polymeric molecules coupled to the polypeptide may be any 

suitable polymeric molecule, including natural and synthetic homo-poly- 
mers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly-NH 2 ) and 
polycarboxyl acids (i.e. poly-COOH) , and further hetero-polymers i.e. 
polymers comprising one or more different coupling groups e.g. a 

2 0 hydroxyl group and amine groups. 

Examples of suitable polymeric molecules include polymeric 
molecules selected from the group comprising polyalkylene oxides (PAO) , 
such as polyalkylene glycols (PAG) , including polyethylene glycols 
(PEG) , methoxypolyethylene glycols (mPEG) and polypropylene glycols, 
25 PEG-glycidyl ethers (Epox-PEG) , PEG-oxycarbonylimidazole (CDI-PEG) , 
branched PEGs, polyvinyl alcohol (PVA) , polycarboxylates, 
polyvinylpyrrolidones , poly-D, L-amino acids , polyethylene-co-maleic acid 
anhydride, polystyrene-co-malic acid anhydride, dextrans including 
carboxymethyl-dextrans, heparin, homologous albumin, celluloses, 

3 0 including methylcellulose, carboxymethylcellulose, ethylcellulose, 

hydroxyethylcellulose carboxyethylcellulose and hydroxypropylcellulose, 
hydrolysates of chitosan, starches such as hydroxyethyl -starches and 
hydroxy propyl -starches, glycogen, agaroses and derivatives thereof, 
guar gum, pullulan, inulin, xanthan gum, carrageenin, pectin, alginic 
35 acid hydrolysates and bio-polymers. 

Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ( (m) PEG) which further requires a 
relatively simple chemistry for its covalently coupling to attachment 
groups on the enzyme's surface. 
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Generally seen polyalkylene oxides (PAO) , such as polyethylene 
oxides, such as PEG and especially mPEG, are the preferred polymeric 
molecules , as these polymeric molecules, in comparison to 
polysaccharides such as dextran, pullulan and the like, have few 
5 reactive groups capable of cross-linking. 

Even though all of the above mentioned polymeric molecules may be 
used according to the invention the methoxypolyethylene glycols (mPEG) 
may advantageously be used. This arises from the fact that 
me thoxy ethylene glycols have only one reactive end capable of conju- 
10 gating with the enzyme. Consequently, the risk of cross-linking is less 
pronounced. Further, it makes the product more homogeneous and the 
reaction of the polymeric molecules with the enzyme easier to control. 

Preparation of enzyme variants 

15 Enzyme variants to be conjugated may be constructed by any 

suitable method. A number of methods are well established in the art. 
For instance enzyme variants according to the invention may be 
generated using the same materials and methods described in e.g. WO 
89/06279 (Novo Nordisk A/S) , EP 130,756 (Genentech) , EP 479,870 (Novo 

20 Nordisk A/S), EP 214,435 (Henkel) , WO 87/04461 (Amgen) , WO 87/05050 
(Genex) , EP application no. 87303761 (Genentech), EP 260,105 
(Genencor) , WO 88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech), 
WO 88/08028 (Genex) , WO 88/08033 (Amgen) , WO 88/08164 (Genex) , Thomas 
et al., 1985, Nature, 318, 375-376; Thomas et al . , 1987, J. Mol . 

25 Biol., 193, 803-813; Russel and Fersht, 1987, Nature, 328, 496-500. 

Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of interest 
must be cloned in a suitable vector. Methods for generating mutations in 

3 0 specific sites are described below. 

Once the polypeptide encoding gene has been cloned, and desirable 
sites for mutation identified and the residue to substitute for the 
original ones have been decided, these mutations can be introduced using 
synthetic oligonucleotides. These oligonucleotides contain nucleotide 

35 sequences flanking the desired mutation sites; mutant nucleotides are 
inserted during oligo-nucleotide synthesis. In a preferred method, Site- 
directed mutagenesis is carried out by SOE-PCR mutagenesis technique 
described by Kammann et al . , 1989, Nucleic Acids Research, 17(13), 5404, 
and by Sarkar G. and Sommer, S.S., 1990, Biotechniques , 8, 404-407. 
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Activation of polymers 

If the polymeric molecules to be conjugated with the polypeptide 
in question are not active it must be activated by the use of a suitable 
5 technique. It is also contemplated according to the invention to couple 
the polymeric molecules to the polypeptide through a linker. Suitable 
linkers are well-known to the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described in the 

10 literature. Commonly used methods for activation of insoluble polymers 
include activation of functional groups with cyanogen bromide, 
periodate, glutaraldehyde, biepoxides , epichlorohydrin, divinylsulf one, 
carbodiimide, sulfonyl halides, trichlorotriazine etc. (see R.F. Taylor, 
(1991), "Protein Immobilisation. Fundamentals and Applications", Marcel 

15 Dekker, N.Y.; S.S. Wong, 1992, "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al . , 1993, 
"Immobilized Affinity Ligand Techniques", Academic Press, N.Y.). Some of 
the methods concern activation of insoluble polymers but are also 
applicable to activation of soluble polymers e.g. periodate, 

2 0 trichlorotriazine, sulf onylhalides, divinylsulf one, carbodiimide etc. 

The functional groups being amino, hydroxyl, thiol, carboxyl, aldehyde 
or sulfydryl on the polymer and the chosen attachment group on the 
protein must be considered in choosing the activation and conjugation 
chemistry which normally consist of i) activation of polymer, ii) 
25 conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation methods 
will be described shortly. However, it is to be understood that also 
other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 

3 0 peptides may be performed with the aid of diimide and for example amino- 

PEG or hydrazino-PEG (Pollak et al . , 1976, J. Amr. Chem. Soc . , 98, 
289-291) or diazoacetate/amide (Wong et al . , 1992, "Chemistry of Protein 
Conjugation and Crosslinking", CRC Press) . 

Coupling polymeric molecules to hydroxy groups are generally very 
3 5 difficult as it must be performed in water. Usually hydrolysis predomi- 
nates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (U.S. Patent No. 5,414,135, (1995), Snow et 
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al.) has a preference for sulfhydryl groups but is not as selective as 
the other mentioned. 

Accessible arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 
5 Techniques involving coupling electrophilically activated PEGs to 

the amino groups of lysines may also be useful . Many of the usual 
leaving groups for alcohols give rise to an amine linkage. For instance, 
alkyl sulfonates, such as tresylates (Nilsson et al . , 1984, Methods in 
Enzymology, 104, Jacoby, W. B., Ed., Academic Press: Orlando, 56-66; 

10 Nilsson et al . , 1987, Methods in Enzymology, 135; Mosbach, K. , Ed.; 
Academic Press: Orlando, 65-79; Scouten et al . , 1987, Methods in 
Enzymology, 135, Mosbach, K. , Ed., Academic Press: Orlando, 1987, 79-84; 
Crossland et al . , 1971, J. Amr. Chem. Soc, 93, 4217-4219), mesylates 
(Harris, (1985), supra; Harris et al . , 1984, J. Polym. Sci. Polym. Chem. 

15 Ed., 22, 341-352), aryl sulfonates like tosylates, and para-nitrobenzene 
sulfonates can be used. 

Organic sulfonyl chlorides, e.g. tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into good 
leaving groups (sulfonates) that, when reacted with nucleophiles like 

2 0 amino groups in polypeptides allow stable linkages to be formed between 

polymer and polypeptide. In addition to high conjugation yields, the 
reaction conditions are in general mild (neutral or slightly alkaline 
pH, to avoid denaturation and little or no disruption of activity) , and 
satisfy the non- destructive requirements to the polypeptide. 
25 Tosylate is more reactive than the mesylate but also more unstable 

decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 1995, 
Bioconjugate Chem., 6, 150-165). Epoxides may also been used for 
creating amine bonds but are much less reactive than the above mentioned 
groups . 

3 0 Converting PEG into a chlorof ormate with phosgene gives rise to 

carbamate linkages to lysines. This theme can be played in many variants 
substituting the chlorine with N-hydroxy succinimide (U.S. Patent No. 
5,122,614, (1992); Zalipsky et al . , 1992, Biotechnol . Appl . Biochem. , 
15, 100-114; Monfardini et al . , 1995, Bioconjugate Chem., 6, 62-69, with 
35 imidazole (Allen et al . , 1991, Carbohydr. Res., 213, 309-319), with 
para -nitrophenol , DMAP (EP 632 082, 1993, Looze, Y.) etc. The 
derivatives are usually made by reacting the chlorof ormate with the 
desired leaving group. All these groups give rise to carbamate linkages 
to the peptide. 
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Furthermore , isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imide thrones (U.S. Patent No. 
5 5,349,001 (1994), Greenwald et al . ) . The reactivity of these compounds 
is very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can also 
be used. The hereby comprised ester group make the conjugate much more 
susceptible to hydrolysis (U.S. Patent No. 5,122,614, 1992, Zalipsky) . 
10 This group may be activated with N-hydroxy succinimide. 

Furthermore, a special linker can be introduced. The oldest being 
cyanuric chloride (Abuchowski et al . , 1977, J. Biol. Chem., 252, 
3578-3581; U.S. Patent No. 4,179,337, 1979, Davis et al . ; Shafer et al . , 
1986, J. Polym. Sci. Polym. Chem. Ed., 24, 375-378. 
15 Coupling of PEG to an aromatic amine followed by diazotization 

yields a very reactive diazonium salt which in situ can be reacted with 
a peptide. An amide linkage may also be obtained by reacting an 
azlactone derivative of PEG (U.S. Patent No. 5,321,095, 1994, Greenwald, 
R. B.) thus introducing an additional amide linkage. 

2 0 As some peptides do not comprise many lysines, it may be 

advantageous to attach more than one PEG to the same lysine. This can be 
done e.g. by the use of 1, 3-diamino-2-propanol . 

PEGs may also be attached to the amino-groups of the enzyme with 
carbamate linkages (WO 95/11924, Greenwald et al . ) . Lysine residues may 
25 also be used as the backbone. 

The coupling technique used in the examples is the N-succinimidyl 
carbonate conjugation technique described in WO 90/13590 (Enzon) . 

Method for preparing improved conjugates 

3 0 It is also an object of the invention to provide a method for 

preparing improved polypeptide -polymer conjugates comprising the steps 
of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 
35 b) selecting target amino acid residues on the surface of said 

3D structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
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attachment group, and/or ii) substituting or deleting one or more amino 
acid residues selected in step b) at or close to the functional site(s), 
d) coupling polymeric molecules to the mutated polypeptide. 



5 Step a) Identifying amino acid residues located on the surface of the 
parent polypeptide 

3 -dimensional structure (3D-structure) 

To perform the method of the invention a 3 -dimensional structure 
of the parent polypeptide in question is required. This structure may 
10 for example be an X-ray structure, an NMR structure or a model-built 
structure. The Brookhaven Databank is a source of X-ray- and NMR- 
structures . 

A model-built structure may be produced by the person skilled in 
the art if one or more 3D-structure (s) exist (s) of homologous 
15 polypeptide (s) sharing at least 30% sequence identity with the 
polypeptide in question. Several software packages exist which may be 
employed to construct a model structure. One example is the Homology 
95.0 package from Biosym. 

Typical actions required for the construction of a model 

2 0 structure are: alignment of homologous sequences for which 3D- 

structures exist, definition of Structurally Conserved Regions (SCRs) , 
assignment of coordinates to SCRs, search for structural 
fragments/ loops in structure databases to replace Variable Regions, 
assignment of coordinates to these regions, and structural refinement 
25 by energy minimization. Regions containing large inserts (>3 
residues) relative to the known 3D-structures are known to be quite 
difficult to model, and structural predictions must be considered with 
care . 

Having obtained the 3D-structure of the polypeptide in question, 

3 0 or a model of the structure based on homology to known structures, 

this structure serves as an essential prerequisite for the fulfillment 
of the method described below. 

Step b) Selection of target amino acid residues for mutation 
35 Target amino acid residues to be mutated are according to the 

invention selected in order to obtain additional or fewer attachment 
groups, such as free amino groups (-NH 2 ) or free carboxylic acid groups 
(-COOH) , on the surface of the polypeptide and/or to obtain a more 
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complete and broadly spread shielding of the epitope (s) on the surface 
of the polypeptide. 

Conservative substitution 
5 It is preferred to make conservative substitutions in the 

polypeptide, as conservative substitutions secure that the impact of 
the mutation on the polypeptide structure is limited. 

In the case of providing additional amino groups this may be done 
by substitution of arginine to lysine, which are both positively 
10 charged, but only the lysine having a free amino group suitable as an 
attachment group. 

In the case of providing additional carboxylic acid groups the 
conservative substitution may for instance be an aspargine to aspartic 
acid or glutamine to glutamic acid substitution. These residues 
15 resemble each other in size and shape, except from the carboxylic 
groups being present on the acidic residues. 

In the case of providing fewer attachment groups, e.g. at or 
close to the active site, a lysine may be substituted with an 
arginine, and so on. 
20 Which amino acids to substitute depends in principle on the 

coupling chemistry to be applied. 

Non-conservative substitution 

The mutation may also be on target amino acid residues which are 
25 less/non-conservative. Such mutation is suitable for obtaining a more 
complete and broadly spread shielding of the polypeptide surface than 
can be obtained by the conservative substitutions. 

The method of the invention is first described in general terms, 
and subsequently using specific examples. 
3 0 Note the use of the following terms: 

Attachment_residue : residue (s) which can bind polymeric 
molecules, e.g. lysines (amino group) or aspartic/glutamic acids 
(carboxylic groups) . N- or Oterminal amino/ carboxylic groups are to 
be included where relevant . 
35 Mutation__residue : residue (s) which is to be mutated, e.g. 

arginine or aspargine/glutamine . 

Essential_catalytic_residues : residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in serine 
proteases . 
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Solvent_exposed_residues : These are defined as residues which are 
at least 5% exposed according to the BIOSYM/ INSIGHT algorithm found in 
the module Homology 95.0. The sequence of commands is as follows: 
Homology=>ProStat=>Access_Surf =>Solv_Radius 1.4; Heavy atoms only; 
5 Radii source VdW; Output: Fractional Area; Polarity source: Default. 
The file f ilename_area . tab is produced. Note: For this program to 
function properly all water molecules must first be removed from the 
structure . 

It looks for example like: 



10 


# PD4 9 8 FINALMODEL 




# residue 


area 




TRP__1 


136.275711 




SER__2 


88 .188095 




PRO_3 


15.458788 


15 


ASN_4 


95 .322319 




ASP_5 


4 .903404 




PRO_6 


68.096909 




TYR_7 


93 .333252 




TYR_8 


31.791576 


20 


SER 9 


95.983139 



continued 

1. Identification of residues which are more than 10 Angstroms away 
from the closest attachment_residue, and which are located at least 8 

25 Angstroms away from essential_catalytic_residues . This residue subset 
is called REST, and is the primary region for conservative 
mutation_residue to attachment_residue substitutions. 

2. Identification of residues which are located in a 0-5 Angstroms 
3 0 shell around subset REST, but at least 8 Angstroms away from 

essential_catalytic_residues . This residue subset is called SUB5B. 
This is a secondary region for conservative mutation_residue to 
attachment_residue substitutions, as a ligand bound to an 
attachment_residue in SUB5B will extend into the REST region and 
35 potentially prevent epitope recognition. 

3. Identification of solvent_exposed mutation__residues in REST and 
SUB5B as potential mutation sites for introduction of 
attachment residues . 
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4. Use BIOS YM/ INSIGHT" s Biopolymer module and replace residues 
identified under action 3. 

5 5. Repeat 1-2 above producing the subset RESTx . This subset 
includes residues which are more than 10 Angstroms away from the 
nearest attachment_residue, and which are located at least 8 Angstroms 
away from essential catalytic residues. 

10 6. Identify solvent_exposed_residues in RESTx. These are potential 
sites for less/non- conservative mutations to introduce 
atttachment_residues . 

Step c) Substituting, inserting or deleting amino acid residues 
15 The mutation (s) performed in step c) may be performed by standard 

techniques well known in the art, such as site-directed mutagenesis 

(see, e.g., Sambrook et al . , 1989, Molecular Cloning. A Laboratory 

Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found in 
20 e.g. Ford et al., 1991, Protein Expression and Purification, 2, 95-107. 

Step d) Coupling polymeric molecules to the modified parent enzyme 

Polypeptide -polymer conjugates of the invention may be prepared by 
any coupling method known in the art including the above mentioned 
25 techniques. 

Coupling of polymeric molecules to the polypeptide in question 

If the polymeric molecules to be conjugated with the polypeptide 
are not active it must be activated by the use of a suitable method. The 

3 0 polymeric molecules may be coupled to the polypeptide through a linker. 
Suitable linkers are well known to the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described in the 
literature. Commonly used methods for activation of insoluble polymers 

35 include activation of functional groups with cyanogen bromide, 
periodate, glutaraldehyde, biepoxides , epichlorohydrin, divinylsulf one, 
carbodiimide, sulfonyl halides, trichlorotriazine etc. (see R.F. Taylor, 
1991, "Protein Immobilisation. Fundamentals and Applications", Marcel 
Dekker, N.Y.; S.S. Wong, 1992, "Chemistry of Protein Conjugation and 
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Cros si inking " , CRC Press, Boca Raton; G.T. Hermanson et al . , 1993, 
"Immobilized Affinity Ligand Techniques 11 , Academic Press, N.Y.). Some of 
the methods concern activation of insoluble polymers but are also 
applicable to activation of soluble polymers e.g. periodate, 
5 trichlorotriazine, sulf onylhalides, divinylsulf one, carbodiimide etc. 
The functional groups being amino, hydroxyl, thiol, carboxyl, aldehyde 
or sulfydryl on the polymer and the chosen attachment group on the 
protein must be considered in choosing the activation and conjugation 
chemistry which normally consists of i) activation of polymer, ii) 
10 conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation methods 
will be described shortly. However, it is to be understood that also 
other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
15 can be performed with the aid of diimide and for example amino-PEG or 
hydrazino-PEG (Pollak et al., 1976, J. Amr. Chem. Soc, 98, 289-291) or 
diazoacetate/amide (Wong et al . , 1992, "Chemistry of Protein Conjugation 
and Crosslinking" , CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally very 

2 0 difficult as it must be performed in water. Usually hydrolysis predomi- 

nates over reaction with hydroxyl groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached wih special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (U.S. Patent No. 5,414,135 (1995), Snow et 
25 al . ) has a preference for sulfhydryl groups but is not as selective as 
the other mentioned. 

Accessible arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs to 

3 0 the amino groups of lysines are also useful. Many of the usual leaving 

groups for alcohols give rise to an amine linkage. For instance, alkyl 
sulfonates, such as tresylates (Nilsson et al., 1984, Methods in 
Enzymology, 104, Jacoby, W. B., Ed., Academic Press: Orlando, 56-66; 
Nilsson et al . , (1987), Methods in Enzymology, 135; Mosbach, K. , Ed.; 
35 Academic Press: Orlando, 65-79; Scouten et al . , 1987, Methods in 
Enzymology, 135, Mosbach, K. , Ed., Academic Press: Orlando, 1987; 79-84; 
Crossland et al . , 1971, J. Amr. Chem. Soc, 1971, 93, 4217-4219), 
mesylates (Harris, 1985, supra; Harris et al . , 1984, J. Polym. Sci. 
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Polym. Chem. Ed., 22, 341-352), aryl sulfonates like tosylates, and 
para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into good 
5 leaving groups (sulfonates) that, when reacted with nucleophiles like 
amino groups in polypeptides allow stable linkages to be formed between 
polymer and polypeptide. In addition to high conjugation yields, the 
reaction conditions are in general mild (neutral or slightly alkaline 
pH, to avoid denaturation and little or no disruption of activity) , and 
10 satisfy the non- destructive requirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more unstable 
decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 1995, 
Bioconjugate Chem., 6, 150-165). Epoxides may also been used for 
creating amine bonds but are much less reactive than the above mentioned 
15 groups. 

Converting PEG into a chlorof ormate with phosgene gives rise to 
carbamate linkages to lysines. This theme can be played in many variants 
substituting the chlorine with N-hydroxy succinimide (U.S. Patent No. 
5,122,614 (1992); Zalipsky et al . , 1992, Biotechnol . Appl . Biochem., 15, 
20 100-114; Monfardini et al . , 1995, Bioconjugate Chem., 6, 62-69, with 
imidazole (Allen et al . , 1991, Carbohydr. Res., 213, 309-319), with 
para -nitrophenol , DMAP (EP 632 082, 1993, Looze, Y.) etc. The 
derivatives are usually made by reacting the chloroformate with the 
desired leaving group. All these groups give rise to carbamate linkages 
25 to the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imide thrones (U.S. Patent No. 
30 5,349,001 (1994), Greenwald et al . ) . The reactivity of these compounds 
is very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can also 
be used. The hereby comprised ester group makes the conjugate much more 
susceptible to hydrolysis (US patent no. 5,122,614, (1992), Zalipsky). 
3 5 This group may be activated with N-hydroxy succinimide. 

Furthermore, a special linker can be introduced. The oldest being 
cyanuric chloride (Abuchowski et al . , 1977, J. Biol. Chem., 252, 
3578-3581; U.S. Patent No. 4,179,337, 1979, Davis et al . ; Shafer et al . , 
1986, J. Polym. Sci . Polym. Chem. Ed., 24, 375-378). 
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Coupling of PEG to an aromatic amine followed by diazotization 
yields a very reactive diazonium salt which in situ can be reacted with 
a peptide. An amide linkage may also be obtained by reacting an 
azlactone derivative of PEG (U.S. Patent No. 5,321,095, (1994), 
5 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many lysines, it may be 
advantageous to attach more than one PEG to the same lysine. This can be 
done e.g. by the use of 1, 3-diamino-2-propanol . 

PEGs may also be attached to the amino-groups of the enzyme with 
10 carbamate linkages (WO 95/11924, Greenwald et al.). Lysine residues may 
also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG conjugates 
15 A specific example of a protease is the parent PD498 (WO 93/24623 

and SEQ ID NO: 2) . The parent PD498 has a molecular weight of 29 kDa. 



Lysine and arginine residues are located as follows: 



Distance from the active site 


Arginine 


Lysine 


0-5 Angstroms 


1 




5-10 Angstroms 






10-15 Angstroms 


5 


6 


15-20 Angstroms 


2 


3 


20-25 Angstroms 


1 


3 


Total 


9 


12 



The inventors examined which parent PD4 98 sites on the surface may 

2 0 be suitable for introducing additional attachment groups. 

A. Suitable conservative arginine to lysine substitutions in 
parent PD498 may be any of R51K, R62K, R121K, R169K, R250K, R28K, R190K. 

B. Suitable non- conservative substitutions in parent PD498 may be 
any of P6K, Y7K, S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, 

25 I88K, N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 

As there are no lysine residues at or close to the active site 
there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 

3 0 conjugation technique known in the art for coupling polymeric molecules 

to amino groups on the enzyme. A specific example is described below. 
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Removal of attachment groups 

Specific examples of BPN' variant-SPEG conjugates 

A specific example of a protease having an attachment group in the 
active site is BPN" which has 11 attachment groups (plus an N-terminal 
5 amino group) : BPN' has a molecular weight of 28 kDa. 



Lysine and arginine residues are located as follows: 



Distance from the active site 


Arginine 


Lysine 


0-5 Angstroms 




1 


5-10 Angstroms 






10-15 Angstroms 


1 


4 


15-20 Angstroms 


1 


4 


20-25 Angstroms 




2 


Total 


2 
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The lysine residue located within 0-5 Angstroms of the active site 
can according to the invention advantageously be removed. Specifically 
10 this may be done by a K94R substitution. 

BPN" variant-SPEG conjugates may be prepared using the above 
mentioned BPN" variant as the starting material by any conjugation 
technique known in the art for coupling polymeric molecules to amino 
groups on the enzyme. 

15 

Addition and removal of attachment groups 

Specific example of SAVINASE® -S PEG conjugates 

As described in Example 2 parent SAVINASE® (von der Osten et al . , 
1993, Journal of Biotechnology, 28, 55+ and SEQ ID NO: 3) may according 
2 0 to the invention have added a number of amino attachment groups to the 
surface and removed an amino attachment group close to the active site. 

Any of the following substitutions in SAVINASE® are sites for 
mutagenesis: R10K, R19K, R45K, R145K, R170K, R186K and R247K. 

The substitution K94R is identified as a mutation suitable for 
2 5 preventing attachment of polymers close to active site. 

SAVINASE® variant-SPEG conjugates may be prepared using any of the 
above mentioned SAVINASE® variants as the starting material by any 
conjugation technique known in the art for coupling polymeric molecules 
to amino groups on the enzyme. 

30 
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Addition of attachment groups 

Specific examples of Humicola lanuginosa lipase variants -SPEG conjugates 
Specific examples of lipase variants with reduced immunogeni city- 
using the parent Huminocal lanuginosa DSM 4109 lipase (see SEQ ID NO: 6) 
5 as the backbone for substitutions are listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 attachment 
groups including the N-terminal NH 2 group and a molecular weight of 
about 2 9 kDa . 

Suitable conservative arginine to lysine substitutions in the 
10 parent lipase may be any of R133K, R139K, R160K, R179K, R209K, R118K 
and R125K. 

Suitable non- conservative substitutions in the parent lipase may 
be any of ; 

A18K, G31K, T32K, N33K, G38K # A40K, D48K, T50K, E56K, D57K, S58K, 
15 G59K, V60K, G61K, D62K, T64K, L78K, N88K, G91K, N92K, L93K, S105K, 
G106K, V120K, P136K, G225K, L227K, V228K, P229K, P250K, F262K. 

Further suitable non-conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

Lipase variant -SPEG conjugates may be prepared using any of the 
2 0 above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric molecules 
to amino groups on the enzyme. A specific example is described below. 

In Example 12 below it is shown that a conjugate of the Humicola 
lanuginosa lipase variant with E87K+D254K substitutions coupled to S-PEG 
25 15,000 has reduced immunogenic response in Balb/C mice in comparison to 
the corresponding parent unmodified enzyme. 

Immunogenicity and Allergenicity 

" Immunogenic ity" is a broader term than "antigenicity" and 
30 "allergenicity", and expresses the immune system's response to the 

presence of foreign substances. Said foreign substances are called 

immunogens, antigens and allergens depending of the type of immune 

response they elicit. 

An "immunogen" may be defined as a substance which, when intro- 
35 duced into circulatory system of animals and humans, is capable of 

stimulating an immunologic response resulting in formation of 

immunoglobulin. 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self molecule. 
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Further, an "allergen" may be defined as an antigen which may give 
rise to allergic sensitization or an allergic response by IgE antibodies 
(in humans, and molecules with comparable effects in animals) . 

5 Assessment of immunogenicity 

Assessment of the immunogenicity may be made by injecting animal 
subcutaneous ly to enter the immunogen into the circulation system and 
comparing the response with the response of the corresponding parent 
polypeptide. 

10 The "circulatory system" of the body of humans and animals means, 

in the context of the present invention, the system which mainly 
consists of the heart and blood vessels. The heart delivers the 
necessary energy for maintaining blood circulation in the vascular 
system. The circulation system functions as the organism's 

15 transportation system, when the blood transports 0 2 , nutritious matter, 
hormones, and other substances of importance for the cell regulation 
into the tissue. Further the blood removes C0 2 from the tissue to the 
lungs and residual substances to e.g. the kidneys. Furthermore, the 
blood is of importance for the temperature regulation and the defense 

2 0 mechanisms of the body, which include the immune system. 

A number of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models include a 
mice model. 

2 5 This model seeks to identify the immunogenic response in the form 

of the IgG response in Balb/C mice being injected subcutaneous ly with 
modified and unmodified polypeptides. 

Also other animal models can be used for assessment of the 
immunogenic potential . 

3 0 A polypeptide having "reduced immunogenicity" according to the 

invention indicates that the amount of produced antibodies, e.g. 
immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is significantly 
decreased, when introduced into the circulatory system, in comparison to 
3 5 the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
immunigenic potential of polypeptides. 
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Assessment of allergenicity 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheal ly (into the trachea) administrated 
parent enzymes with the corresponding modified enzymes according to the 
5 invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable basis for 
hazard assessment in man. Suitable models include a guinea pig model and 
a mouse model . These models seek to identify respiratory allergens as a 

10 function of elicitation reactions induced in previously sensitized 
animals. According to these models the alleged allergens are introduced 
intratracheally into the animals. 

A suitable strain of guinea pigs, the Dunkin Hartley strain, does 
not as humans, produce IgE antibodies in connection with the allergic 

15 response. However, they produce another type of antibody the IgGIA and 
IgGIB (see e.g. Prent0, ATLA, 19, 8-14, 1991), which are responsible for 
their allergenic response to inhaled polypeptides including enzymes. 
Therefore, when using the Dunkin Hartley animal model, the relative 
amount of IgGIA and IgGIB is a measure of the allergenicity level. 

2 0 The Balb/C mice strain is suitable for intratracheal exposure. 

Balb/C mice produce IgE as the allergic response. 

More details on assessing respiratory allergens in guinea pigs and 
mice are described by Kimber et al., 1996, Fundamental and Applied 
Toxicology, 33, 1-10. 
25 Other animals such as rats, rabbits etc. may also be used for 

comparable studies. 

Composition 

The invention relates to a composition comprising a polypeptide- 

3 0 polymer conjugate of the invention. 

The composition may be a pharmaceutical or industrial composition. 

The composition may further comprise other polypeptides, proteins 
or enzymes and/or ingredients normally used in e.g. detergents, includ- 
ing soap bars, household articles, agrochemicals, personal care prod- 
35 ucts, including skin care compositions, cleaning compositions for e.g. 
contact lenses, oral and dermal pharmaceuticals, composition use for 
treating textiles, compositions used for manufacturing food, e.g. 
baking, and feed etc. 
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Use of the polypeptide -polymer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immune response of polypeptides. 

It is also an object of the invention to use the polypeptide- 
5 polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, dish wash and 
hard surface cleaning detergents, and food or feed products. 

MATERIAL AND METHODS 
10 Materials 

Enzymes : 

PD498: Protease of subtilisin type shown in WO 93/24623. The sequence of 
PD498 is shown in SEQ ID NOS: 1 and 2. 
SAVINASE® (Available from Novo Nordisk A/S) 
15 Humicola lanuginosa lipase: Available from Novo Nordisk as LIPOLASE® and 
is further described in EP 305,216. The DNA and protein sequence is 
shown in SEQ ID NOS: 5 and 6, respectively. 

Strains : 

20 B. subtilis 309 and 147 are variants of Bacillus lentus, 

deposited with the NCIB and accorded the accession numbers NCIB 10309 

and 10147, and described in U.S. Patent No. 3,723,250 incorporated by 
reference herein. 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J". Mol. 

25 Biol. 138 179-207), was made r",m + by conventional methods and is also 
described in US Patent Application Serial No. 039,298. 

Vectors : 

pPD498: E. coli - B. subtilis shuttle vector (described in US 
30 patent No. 5,621,089 under section 6.2.1.6) containing the wild-type 
gene encoding for PD498 protease (SEQ ID NO: 2) . The same vector is 
used for mutagenesis in E. coli as well as for expression in B. 
subtilis . 

35 General molecular biology methods : 

Unless otherwise mentioned the DNA manipulations and 
transformations were performed using standard methods of molecular 
biology (Sambrook et al . , 1989, Molecular cloning: A laboratory 
Manual, Cold Spring Harbor lab., Cold Spring Harbor, NY; Ausubel, F. 
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M. et al . (eds.) "Current protocols in Molecular Biology". John Wiley 
and Sons, 1995; Harwood, C. R., and Cutting, S. M. (eds.) "Molecular 
Biological Methods for Bacillus". John Wiley and Sons, 1990) . 

Enzymes for DNA manipulations were used according to the 
5 specifications of the suppliers. 



Materials, chemicals and solutions: 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1:1000). 
10 Mouse ant i- rat IgE (Serotec MCA193; dilution 1:200) . 

Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 

Biotin-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1:1000) 

Biotin-labeled rat ant i -mouse IgGl monoclonal antibody (Serotec 
15 MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (Kirkeg&rd & Perry 14-30-00; 
dilution 1:1000) . 

CovaLink NH 2 plates (Nunc, Cat# 459439) 

Cyanuric chloride (Aldrich) 
20 Acetone (Merck) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 

Streptavidin, peroxidase (KPL) 

Or t ho - Pheny lene - di amine ( OPD ) ( Kern- en - Tec ) 

H 2 0 2 , 30% (Merck) 
25 Tween 20 (Merck) 

Skim Milk powder (Difco) 

H 2 S0 4 (Merck) 

Buffers and Solutions: 
30 Carbonate buffer (0.1 M, pH 10 (1 liter)) Na 2 C0 3 10.60 g 
PBS (pH 7.2 (1 liter)) NaCl 8.00 g 

KC1 0.20 g 

K 2 HP0 4 1.04 g 

KH 2 P0 4 0.32 g 

35 Washing buffer PBS, 0.05% (v/v) Tween 20 

Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk powder 
Citrate buffer (0.1 M, pH 5.0-5.2 (1 liter) ) NaCitrate 20.60 g 

Citric acid 6.30 g 



32 



Activation of CovaLink plates: 

Make a fresh stock solution of 10 mg cyanuric chloride per ml 
acetone . 

5 Just before use, dilute the cyanuric chloride stock solution into 

PBS, while stirring, to a final concentration of 1 mg/ml. 

Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 
Wash 3 times with PBS. 
10 Dry the freshly prepared activated plates at 50°C for 30 minutes. 

Immediately seal each plate with sealing tape. 

Preactivated plates can be stored at room temperature for 3 weeks 
when kept in a plastic bag. 

15 Sodium Borate, borax (Sigma) 

3,3-Dimethyl glutaric acid (Sigma) 
CaCl 2 (Sigma) 

Tresyl chloride (2 , 2 , 2-trif louroethansulf onyl chloride) (Fluka) 
l-ethyl-3- (3-dimethylaminopropyl) carbodiimide (EDC) (Fluka) 
20 ^-Hydroxy succinimide (Fluka art. 56480)) 
Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulfonyl flouride) from Sigma 
Succinyl -Alanine -Alanine - Prol ine - Phenylalanine -para-nit roani lide 
25 (Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

Coloring substrate: 

OPD: o-phenylene -diamine, (Kementec cat no. 4260) 

3 0 Test Animals: 

Dunkin Hartley guinea pigs (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, Ry, 
Denmark . 

3 5 Equipment : 

XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 
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Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW. 
SLT: Fotometer from SLT Lablnstruments 
Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
5 Amicon Cell 



Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA manipulations, 
such as e.g. restriction endonucleases , ligases etc., are obtained 
10 from New England Biolabs, Inc. 

Methods 

ELISA procedure for determination of IgGx positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 1:8000 
15 in carbonate buffer and incubated overnight at 4°C. The next day the 
plates are blocked with 2% BSA for 1 hour and washed 3 times with PBS 
Tween 20 . 

1 microgram/ml PD498 is added to the plates and incubated for 1 
hour, then washed 3 times with PBS Tween 20. 
20 All guinea pig sera samples and controls are applied to the ELISA 

plates with 2 microliters sera and 98 microliters PBS, incubated for 1 
hour and washed 3 times with PBS Tween 20. 

Then goat ant i -guinea pig IgG x (1:4000 in PBS buffer (Nordic 
Immunology, 44-682)) is applied to the plates, incubated for 1 hour and 
25 washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma A4187) 
is applied and incubated for 1 hour, washed 2 times in PBS Tween20 and 1 
time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p-nitrophenyl 
3 0 phosphate for 30 minutes at 37°C or until appropriate color has 
developed. 

The reaction is stopped using stop medium (K 2 HP0 4 /HaH 3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using an ELISA reader. 

Double blinds are included on all ELISA plates. 
3 5 Positive and negative sera values are calculated as the average 

blind values added 2 times the standard deviation. This gives an 
accuracy of 95%. 
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Determination of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . Proteins 
were detected by silver staining. The molecule weight was measured 
5 relative to the mobility of Mark-12® wide range molecule weight 
standards from Novex. 

Protease activity 

Analysis with Suc-Ala-Ala-Pro-Phe-pNa ; 
10 Proteases cleave the bond between the peptide and p-nitroaniline 

to give a visible yellow color absorbing at 405 nm. 

Buffer: e.g. Brit ton and Robinson buffer pH 8.3 

Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 
sulfoxide (DMSO) . 100 microliters of this is diluted into 10 ml with 
15 Britton and Robinson buffer. 

The substrate and protease solution is mixed and the absorbance is 
monitored at 405 nm as a function of time and ABS 40 5 nm/min. The tempera- 
ture should be controlled (20-50°C depending on protease) . This is a 
measure of the protease activity in the sample. 

20 

Proteolytic Activity 

In the context of this invention proteolytic activity is 
expressed in Kilo NOVO Protease Units (KNPU) . The activity is 
determined relatively to an enzyme standard (SAVINASE®) , and the 

2 5 determination is based on the digestion of a dimethyl casein (DMC) 
solution by the proteolytic enzyme at standard conditions, i.e. 50°C / 
pH 8.3, 9 min. reaction time, 3 min. measuring time. A folder AF 220/1 
is available upon request to Novo Nordisk A/S, Denmark, which folder 
is hereby included by reference. 

30 A Glycine Unit (GU) is defined as the proteolytic enzyme activity 

which, under standard conditions, during a 15 -minute incubation at 

40°C, with N- acetyl casein as substrate, produces an amount of NH 2 - 
group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
35 according to reaction with the soluble substrate succinyl -alanine- 
alanine -proline -phenyl -alanine -para-nitrophenol , which is described in 
Rothgeb, T.M., Goodlander, B.D., Garrison, P.H., and Smith, L.A., 1988 
Journal of American Oil Chemists Society. 
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Fermentation of PD498 variants 

Fermentation of PD498 variants in B. subtilis are performed at 
30°C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In order 
5 to make an e.g. 2 liter broth 20 Erlenmeyer flasks are fermented 
s imul t aneous ly . 

Media: 

BPX: Composition (per liter) 



10 Potato starch 100 g 

Ground barley 50 g 

Soybean flour 20 g 

Na 2 HP0 4 X 12 H 2 0 9 g 

Pluronic 0.1 g 

15 Sodium caseinate 10 g 



The starch in the medium is liquefied with alpha-amylase and the 
medium is sterilized by heating at 120°C for 45 minutes. After 
sterilization the pH of the medium is adjusted to 9 by addition of 
NaHC0 3 to 0.1 M. 

20 

Purification of PD498 variants 

Approximately 1.6 liters of PD498 variant fermentation broth are 
centrifuged at 5000 rpm for 35 minutes in 1 liter beakers. The 
supernatants are adjusted to pH 7.0 using 10% acetic acid and filtered 

25 on Seitz Supra S100 filter plates. 

The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon S1Y10 UF cartridge. The UF 
concentrate is centrifuged and filtered prior to absorption at room 
temperature on a Bacitracin affinity column at pH 7 . The PD498 variant 

3 0 is eluted from the Bacitracin column at room temperature using 25% 2- 
propanol and 1 M sodium chloride in a buffer solution with 0.01 
dimethylglutaric acid, 0.1 M boric acid and 0.002 M calcium chloride 
adjusted to pH 7 . 

The fractions with protease activity from the Bacitracin 

35 purification step are combined and applied to a 750 ml Sephadex G25 
column (5 cm diameter) equilibrated with a buffer containing 0.01 
dimethylglutaric acid, 0.1 M boric acid and 0.002 M calcium chloride 
adjusted to pH 6.0. 
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Fractions with proteolytic activity from the Sephadex G25 column 
are combined and applied to a 150 ml CM Sepharose CL 6B cation 
exchange column ( 5 cm diameter) equilibrated with a buffer containing 
0. 01 M dimethylglutaric acid, 0,1 M boric acid, and 0. 002 M calcium 
5 chloride adjusted to pH 6.0. 

The protease is eluted using a linear gradient of 0-0. 5 M sodium 
chloride in 1 liter of the same buffer. 

Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2 micron filter. 

10 

Balb/C mice IgG ELISA Procedure: 

The antigen is diluted to 1 mg/ml in carbonate buffer. 

100 ml is added to each well. 

The plates are coated overnight at 4°C. 
15 Unspecific adsorption is blocked by incubating each well for 1 

hour at room temperature with 200 ml blocking buffer. 

The plates are washed 3x with 300 ml washing buffer. 

Unknown mouse sera are diluted in dilution buffer, typically lOx, 
20x and 40x, or higher. 
20 100 ml is added to each well. 

Incubation is for 1 hour at room temperature. 

Unbound material is removed by washing 3x with washing buffer. 

The anti-Mouse IgGl antibody is diluted 2000x in dilution buffer. 

100 ml is added to each well. 
25 Incubation is for 1 hour at room temperature. 

Unbound material is removed by washing 3x with washing buffer. 

Streptavidine is diluted lOOOx in dilution buffer. 

100 ml is added to each well. 

Incubation is for 1 hour at room temperature. 
30 Unbound material is removed by washing 3x with 300 ml washing 

buffer. 

OPD ( 0. 6 mg/ml) and H 2 0 2 { 0. 4 ml/ml) is dissolved in citrate 
buffer. 

100 ml is added to each well. 
35 Incubation is for 10 minutes at room temperature. 

The reaction is stopped by adding 100 ml H 2 S0 4 . 

The plates are read at 492 nm with 620 nm as reference. 
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Immunization of mice 

Balb/C mice (20 grams) are immunized 10 times (intervals of 14 
days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard procedures known in 
5 art. 



EXAMPLES 
Example 1 

Suitable substitutions in PD498 for addition of amino attachment 
10 groups ( -NH 2 ) 

The 3D structure of parent PD498 was modeled as described above 
based on 59% sequence identity with Thermitase® (2tec.pdb) . 

The sequence of PD498 is SEQ ID NO: 2. PD498 residue numbering is 
used, 1-280. 

15 The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone .bcl and makeKzone2 .bcl below: 



Conservative substitutions: 
makeKzone. bcl 
20 1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer/residue 10 Color_Subset 
255,255, 0 

4 Zone Subset NTERM :1:N Static monomer/residue 10 Color__Subset 
25 255,255,0 

5 #NOTE: editnextline ACTSITE residues according to the protein 

6 Zone Subset ACTSITE :3 9,72,226 Static monomer/residue 8 
Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

3 0 8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 

10 Combine Subset REST Difference PD498FINALMODEL ALLZONE 

11 List Subset REST Atom Output_File restatom. list 

12 List Subset REST monomer/residue Output_File restmole . list 
35 13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output_File actsiteatom. list 

15 List Subset ACTSITE monomer/residue Output_File actsitemole . list 

16 # 

17 Zone Subset REST5A REST Static Monomer /Re si due 5 -Color Subset 
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18 Combine Subset SUB5A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB 5 A REST 

20 Color Molecule Atoms SUB5B Specified Specification 255,255,255 

21 List Subset SUB5B Atom Output_File subSbatom. list 

5 22 List Subset SUB5B monomer/residue Output_File subSbmole . list 

23 #Now identify sites for lys->arg substitutions and continue with 
makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list , 
sub5batom. list & accsiteatom. list 

10 

Comments : 

Lines 1-8: The subset ALL ZONE is defined as those residues which 
are either within 10 Angstroms of the free amino groups on lysines or 
the N- terminal, or within 8 Angstroms of the catalytic triad residues 
15 39, 72 and 226. 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE . 

Lines 17-20: Subset SUB5B is defined as those residues in a 5 
Angstroms shell around REST, excluding residues within 8 Angstroms of 

2 0 the catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
Arg51, Argl21, and Arg250. ACTSITE contains Argl03, but position 103 
is within 8 Angstroms from essential_catalytic_residues , and thus not 
relevant . 

25 The color codes are: (255,0,255) = magenta, (255,255,0) yellow, 

(255,0,0) red, and (255, 255, 255) = white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD498 as suitable sites for mutagenesis. The 
residues are substituted below in section 2, and further analysis 

3 0 done : 

Non- conservative substitutions : 
makeKzone2 .bcl 

1 #sourcefile makezone2 .bcl Claus von der Osten 961128 
35 2 # 

3 #having scanned lists (grep arg command) and identified sites for 
lys->arg substitutions 

4 #NOTE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD498FINALMODEL newmodel 
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6 Biopolymer 

7 #NOTE: editnextline object name according to protein 

8 Blank Object On PD4 9 8 F I NALMODEL 

9 #NOTE: editnextlines with lys->arg positions 
5 10 Replace Residue newmodel:51 lys L 

11 Replace Residue newmodel:62 lys L 

12 Replace Residue newmodel:121 lys L 

13 Replace Residue newmodel:169 lys L 

14 Replace Residue newmodel:250 lys L 
10 15 # 

16 #Now repeat analysis done prior to arg->lys / now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 255,0,255 

18 Zone Subset LYSx newmodel : lys :NZ Static monomer/ residue 10 
15 Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel :1:N Static monomer/residue 10 
Color_Subset 255,255,0 

20 #NOTE: editnextline ACTSITEx residues according to the protein 

21 Zone Subset ACTSITEx newmodel : 39 , 72 , 22 6 Static monomer/residue 8 

2 0 Color_Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output_File restxatom. list 

25 26 List Subset RESTx monomer /residue Output_File restxmole . list 

27 # 

28 Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 

29 List Subset ACTSITEx Atom Output_File actsitexatom. list 

30 List Subset ACTSITEx monomer/residue Output_File 

3 0 actsitexmole . list 

31 # 

32 #read restxatom. list or restxmole . list to identify sites for 
(not_arg) ->lys subst . if needed 

3 5 Comments: 

Lines 1-15: Solvent exposed arginines in subsets REST and SUB5B 
are replaced by lysines. Solvent accessibilities are recalculated 
following arginine replacement. 
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Lines 16-23: The subset ALLZONEx is defined as those residues 
which are either within 10 Angstroms of the free amino groups on 
lysines (after replacement) or the N- terminal, or within 8 Angstroms 
of the catalytic triad residues 39, 72 and 226. 
5 Line 24-26: The subset RESTx is defined as those residues not 

included in ALLZONEx, i.e. residues which are still potential epitope 
contributors. Of the residues in RESTx, the following are >5% exposed 
(see lists below): 6-7, 9-12, 43-45, 65, 87-88, 209, 211, 216-221, 
262 . 

10 The following mutations are proposed in parent PD498: P6K, Y7K, 

S9K, A10K, Y11K, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, N209K, 
A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
Relevant data for Example 1 : 
Solvent accessibility data for PD498MODEL: 

15 # PD498MODEL Fri Nov 29 10:24:48 MET 1996 
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TRPJL52 
ASNJL53 
LYS_154 
GLY_155 
5 ALA_156 
VAL_157 
VAL_158 
VAL_159 
ALA_160 

10 ALA_161 
ALA_162 
GLY_163 
ASNJL64 
ASP_165 

15 ASN_166 
VAL_167 
SER_168 
ARG_169 
THR_170 

2 0 PHE_171 

GLN_172 
PRO_173 
ALA_174 
SER_175 
25 TYR_176 
PRO_177 
ASN_178 
ALA_179 
ILE_180 

3 0 ALA_181 

VAL_182 
GLY_183 
ALA_184 
ILE_185 

35 ASP_186 
SER_187 
ASN_188 
ASP_189 
ARG_190 

40 LYS__191 
ALA_192 
SER_193 
PHE_194 
SER_195 

4 5 ASN_196 

TYR_197 
GLY_198 
THRJL99 
TRP__200 

50 VAL_201 
ASP_202 
VAL_203 
THR_204 
ALA_205 

55 PRO_206 
GLY_207 
VAL_208 
ASN 209 



43.133667 

112 .864975 

110.009468 

33 .352180 

3.493014 

1.048144 

2 .043953 

0 .000000 

0.537387 

10.872165 

7 .823834 

12.064573 

81.183388 

64.495300 

83 .457443 

68.516815 

78.799652 

116.937134 

57 .275074 

51.416462 

18.934589 

1.880854 

6.522357 

26.184139 

21.425076 

85.613541 

34 .700817 

0.268693 

1.074774 

3 .761708 

0.000000 

2 .149547 

0.951118 

0 .806080 
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150 .050873 
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1.400973 
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12 . 141763 
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3.409728 

0.000000 

0.000000 

0.000000 

37.137192 

78.286270 
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Subset REST: 

restmole .list 

2 0 Subset REST: 

PD498FINALMODEL: 6-7, 9-12,43-46, 61-63,65, 87-89, 111-114, 117-118, 131, 
PD498FINALMODEL: 13 7 -13 9, 158-159, 169-171, 173-174, 180-181, 209,211, 
PD498FINALMODEL:216-221,232-233,262,E282H 
restatom. list 
25 Subset REST: 

PD4 9 8 F INALMODEL : PRO 6 : N , CA , CD , C , 0 , CB , CG 

PD498FINALMODEL : TYR 7 : N, CA, C, O, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : SER 9 : N, CA, C, 0, CB , OG 

PD4 9 8 FINALMODEL : ALA 10 : N, CA, C, O, CB 

30 PD4 9 8 FINALMODEL : TYR 11 :N, CA, C, 0, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ , OH 

PD4 9 8 F INALMODEL : GLN 1 2 : N , CA , C , O , CB , CG , CD , OE 1 , NE 2 

PD4 9 8 FINALMODEL : ASP 43 : N, CA, C, 0, CB, CG, OD1 , OD2 

PD4 98 FINALMODEL : TYR 44 : N, CA, C, 0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL :ASN 45 : N, CA, C, 0, CB , CG, OD1 , ND2 

3 5 PD4 9 8 FINALMODEL : HIS 46 :N, CA, C, 0, CB, CG, ND1 , CD2 , CE1 , NE2 

PD4 9 8 FINALMODEL : ASP 61 : N, CA, C, O, CB , CG, OD1 , OD2 

PD4 9 8 FINALMODEL: ARG 62 : N, CA, C, 0, CB , CG, CD, NE , CZ , NH1 , NH2 

PD4 9 8 FINALMODEL : AS P 6 3 : N , CA , C , O , CB , CG , OD 1 , OD2 

PD4 9 8 FINALMODEL : ASN 65 : N, CA, C, O, CB, CG, OD1 , ND2 

4 0 PD4 9 8 FINALMODEL :GLY 87:N,CA,C,0 

PD4 9 8 FINALMODEL : ILE 88 : N, CA, C, O, CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL :GLY 89:N,CA,C,0 

PD4 9 8 FINALMODEL :GLY 111:N,CA,C,0 

PD4 9 8 FINALMODEL : SER 11 2 : N , CA , C , O , CB , OG 

45 PD4 9 8 FINALMODEL : LEU 113 : N, CA, C, O, CB, CG, CD1 , CD2 

PD4 9 8 FINALMODEL : ASP 114 : N, CA, C, O, CB , CG, OD1 , OD2 

PD4 9 8 FINALMODEL: ALA 117 : N, CA, C, O, CB 

PD4 98 FINALMODEL : SER 118 : N, CA, C, O, CB, OG 

PD4 9 8 FINALMODEL : LEU 13 1 : N, CA, C, O, CB , CG, CD1 , CD2 

50 PD4 9 8 FINALMODEL : CYS 137 :N, CA, C, O, CB, SG 

PD4 9 8 F INALMODEL : GLU 1 3 8 : N , CA , C , O , CB , CG , CD , OE 1 , OE2 

PD4 9 8 F INALMODEL : CYS 1 3 9 : N , CA , C , O , CB , SG 

PD4 9 8 FINALMODEL: VAL 158 : N, CA, C, O, CB , CGI , CG2 

PD4 9 8 FINALMODEL : VAL 159 : N, CA, C, O, CB, CGI , CG2 

55 PD4 9 8 FINALMODEL: ARG 169 : N, CA, C, 0, CB, CG, CD , NE , CZ , NH1 , NH2 

PD498FINALMODEL :THR 170 : N, CA, C, 0, CB, OG1 , CG2 

PD 4 9 8 F I NALMODE L : PHE 17 1 : N, CA, C , O , CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ 

PD4 9 8 FINALMODEL : PRO 173 : N, CA, CD, C , O, CB , CG 
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PD498FINALMODEL : ALA 174 : N, CA, C, O, CB 
PD498FINALMODEL : ILE 180 :N, CA, C, O, CB, CGI , CG2 , CD1 
PD4 9 8 FINALMODEL : ALA 181 : N, CA, C, O, CB 
PD498FINALMODEL : ASN 209 :N, CA, C, O, CB, CG, OD1 ,ND2 
5 PD4 9 8 FINALMODEL: ALA 211 :N, CA, C, 0, CB 

PD4 9 8 F I NALMODEL : ASN 2 1 6 : N , CA , C , O , CB , CG , OD 1 , ND2 
PD4 9 8 FINALMODEL: ASN 217 :N, CA, C # O, CB, CG, OD1 , ND2 
PD4 9 8 FINALMODEL : GLY 218:N,CA,C,0 

PD4 9 8 FINALMODEL : TYR 2 1 9 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
10 PD4 9 8 FINALMODEL : SER 220 :N, CA, C, O, CB , OG 

PD4 9 8 FINALMODEL : TYR 221 :N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL :VAL 232 :N, CA, C, O, CB, CGI , CG2 

PD4 9 8 F I NALMODEL : ALA 233 :N, CA, C, O, CB 

PD498FINALMODEL : GLY 262:N,CA,C,0 
15 PD4 9 8 FINALMODEL : CA E282H:CA 

Subset SUB5B: 

sub5bmole . list 
Subset SUB5B: 

2 0 PD498FINALMODEL:4-5,8,13-16,34-35,47-51, 53,64,83,85-86, 90-91,120- 

124, 

PD498FINALMODEL: 128-130, 140-141, 143-144, 147-148, 151-152,156-157, 
PD4 9 8 FINALMODEL: 165, 167-168, 172, 175-176, 178-179, 196, 2 00-205,208, 
PD498FINALMODEL:234-237,250,253-254,260-261,263-267, 272,E281H, 
2 5 PD4 9 8 FINALMODEL : E2 8 3H 



subSbatom. list 
Subset SUB5B: 

PD4 9 8 FINALMODEL : ASN 4 : N, CA, C , O , CB , CG, OD1 , ND2 

30 PD4 9 8 FINALMODEL : ASP 5 :N, CA, C, O, CB, CG, OD1 , OD2 

PD4 9 8 FINALMODEL : TYR 8 :N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : TYR 13 : N , CA, C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : GLY 14:N,CA,C,0 

PD4 9 8 FINALMODEL : PRO 1 5 : N , CA , CD , C , O , CB , CG 

3 5 PD4 9 8 F INALMODEL : GLN 1 6 : N , CA , C , O , CB , CG , CD , OE 1 , NE2 

PD4 9 8 FINALMODEL : THR 3 4 : N , CA , C , O , CB , OG1 , CG2 

PD4 9 8 FINALMODEL :VAL 35 :N, CA, C, O, CB, CGI , CG2 

PD4 9 8 FINALMODEL : PRO 47 :N, CA, CD, C, O, CB , CG 

PD4 9 8 F I NALMODE L : ASP 4 8 : N , CA, C , O , CB , CG , OD1 , OD2 

40 PD4 9 8 FINALMODEL : LEU 49 : N, CA, C, O, CB , CG, CD1 , CD2 

PD4 9 8 FINALMODEL: ALA 50 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL :ARG 51 :N, CA, C, O, CB, CG, CD,NE, CZ,NH1,NH2 

PD498FINALMODEL : VAL 53 : N, CA, C, O, CB , CGI , CG2 

PD4 9 8 F INALMODEL : ASN 64 : N , CA, C , 0 , CB , CG , OD1 , ND2 

45 PD4 9 8 FINALMODEL: ASP 83 : N, CA, C, O, CB , CG, OD1 , OD2 

PD4 9 8 FINALMODEL : ASN 85 :N, CA, C, 0, CB , CG, OD1 , ND2 

PD4 9 8 FINALMODEL : ASN 8 6 : N , CA , C , 0 , CB , CG , OD1 , ND2 

PD4 9 8 FINALMODEL : VAL 90 :N, CA, C, 0, CB, CGI, CG2 

PD4 9 8 FINALMODEL: ALA 91 :N, CA, C, 0, CB 

5 0 PD4 9 8 F INALMODEL : ILE 12 0 : N , CA , C , O , CB , CGI , CG2 , CD1 

PD4 9 8 FINALMODEL :ARG 121 : N, CA, C, O, CB , CG, CD, NE, CZ , NH1 , NH2 

PD498FINALMODEL : TYR 122 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL: ALA 123 : N, CA, C, O, CB 

PD498FINALMODEL : ALA 124 : N, CA, C, O, CB 

55 PD498FINALMODEL : ALA 128 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL : LYS 12 9 : N , CA, C , O , CB , CG , CD , CE , NZ 

PD4 98FINALMODEL : VAL 13 0 : N , CA, C, 0 , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ASN 140:N,CA,C,O,CB,CG, OD1 ,ND2 
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PD4 98FINALMODEL : SER 141 : N, CA, C , O, CB , OG 
PD498FINALMODEL : THR 143 : N, CA, C, O, CB , OG1 , CG2 
PD4 9 8 F INALMODEL : LEU 144 : N, CA, C, 0, CB , CG, CD1 , CD2 
PD498FINALMODEL : ALA 147 : N, CA, C, 0, CB 
PD4 98FINALMODEL : VAL 148 : N, CA, C, O, CB , CGI , CG2 
PD498FINALMODEL : ALA 151 :N, CA, C , O , CB 

PD4 9 8 F INALMODEL : TRP 52 : N, CA, C , 0, CB , CG, CD1 , CD 2 , NE1 , CE2 , CE3 , CZ2 , 
CZ3,CH2 



PD4 9 8 FINALMODEL 


: ALA 


156 


:N, CA 


c 


fO 


CB 


PD4 9 8 F INALMODEL 


: VAL 


157 


:N, CA 


c 


rO 


CB , CGI , CG2 


PD4 9 8 F I NALMODEL 


: ASP 


165 


:N, CA, 


c 


fO 


CB CG OD1.0D2 


PD4 9 8 FINALMODEL 


: VAL 


167 


:N, CA, 


c 


fO 


CB CGI CG2 


PD4 9 8 F INALMODEL 


:SER 


168 


:N, CA, 


c 


,0 


CB,0G 


PD4 9 8 F INALMODEL 


:GLN 


172 


:N, CA, 


c 


fO 


CB CG CD OEl NE2 


PD4 9 8 FINALMODEL 


:SER 


175 


:N, CA, 


c 


rO, 


CB, OG 


PD4 9 8 FINALMODEL 


:TYR 


176 


:N, CA, 


c, 


r o, 


CB . CG , CD1 , CD2 , CE1 . CE2 CZ OH 


PD4 9 8 F INALMODEL 


: ASN 


178 


:N, CA, 


c, 


o, 


CB,CG,0D1,ND2 


PD4 9 8 F I NALMODEL 


: ALA 


179 


:N, CA, 


c, 


o, 


CB 


PD4 9 8 F INALMODEL 


: ASN 


196 


:N, CA, 


c, 


o, 


CB,CG,0D1,ND2 


PD4 9 8 FINALMODEL 


:TRP 


200 


:N, CA, 




o, 


CB, CG, CD1 , CD2 , NE1 , CE2 , CE3 , CZ2 , 


CZ3 , CH2 














PD4 9 8 FINALMODEL 


: VAL 


201 


:N, CA, 


c ( 


o, 


CB CGI CG2 


PD4 9 8 F INALMODEL 


: ASP 


202 


:N, CA, 




o, 


CB CG ODl 0D2 


PD4 9 8 FINALMODEL 


: VAL 


203 


:N, CA, 


c J 


o, 


CB CGI CG2 


PD4 9 8 F INALMODEL 


:THR 


204 


:N, CA, 


c, 


o, 


CB OGl CG2 


PD4 98 FINALMODEL 


; AT. A 


205 


:N, CA, 


c, 


o, 


CB 


PD4 9 8 F INALMODEL 


: VAL 


208 


:N, CA, 


c, 


o, 


CB CGI CG2 


PD4 9 8 FINALMODEL 


:GLY 


234 


:N,CA, 


c, 


0 




PD4 9 8 FINALMODEL 


:LEU 


235 


:N,CA, 


c, 


o, 


CB,CG,CD1,CD2 


PD4 9 8 FINALMODEL 


: ALA 


236 


:N,CA, 


c, 


o, 


CB 


PD4 9 8 FINALMODEL 


: ALA 


237 


:N,CA, 


c, 


0, 


CB 


PD4 9 8 FINALMODEL 


: ARG 


250 


:N,CA, 


c, 


o, 


CB / CG,CD,NE,CZ,NH1,NH2 


PD4 9 8 F INALMODEL 


:ILE 


253 


:N,CA, 


c, 


o, 


CB , CGI , CG2 , CD1 


PD4 9 8 F INALMODEL 


:GLU 


254 


:N,CA, 


c, 


o # 


CB,CG,CD,0E1,0E2 


PD4 9 8 FINALMODEL 


: ILE 


260 


:N,CA, 


c, 


o, 


CB,CG1,CG2,CD1 


PD4 9 8 FINALMODEL 


:SER 


261 


:N,CA, 


c, 


o, 


CB,0G 


PD4 9 8 FINALMODEL 


:THR 


263 


:N,CA, 


c, 


o, 


CB,0G1,CG2 


PD4 9 8 FINALMODEL 


:GLY 


264 


:N,CA, 


c, 


0 




PD4 9 8 FINALMODEL 


: THR 


265 


:N,CA, 


c, 


o, 


CB,0G1,CG2 


PD4 9 8 FINALMODEL 


:ASN 


266 


:N,CA, 


c, 


o, 


CB,CG,0D1,ND2 


PD4 9 8 FINALMODEL 


:PHE 


267 


:N,CA, 


c, 


o, 


CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 


PD4 9 8 FINALMODEL 


:ILE 


272 


:N,CA, 


c, 


o, 


CB,CG1,CG2,CD1 



PD4 9 8 FINALMODEL : CA E281H:CA 
PD4 9 8 FINALMODEL : CA E283H:NA 

Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

PD498FINALMODEL:36-42, 57-60, 66-80, 100-110, 115-116, 119, 132-136,160- 

164, 

PD498FINALMODEL: 182-184, 194, 206-2 07,210,212-215,222-231 

actsiteatom. list 
Subset ACTSITE: 

PD498FINALMODEL : ALA 3 6 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL : VAL 37 :N, CA, C, O, CB, CGI , CG2 

PD4 98 FINALMODEL : LEU 3 8 : N, CA, C , O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL : ASP 3 9 : N , CA , C , O , CB , CG , ODl , OD2 
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PD498FINALMODEL : SER 40 : N, CA, C, O, CB , OG 

PD498FINALMODEL : GLY 41:N,CA,C,0 

PD4 98FINALMODEL : VAL 42 :N, CA, C, 0, CB, CGI , CG2 

PD 4 9 8 F I NALMODE L : TYR 57 :N, CA, C, 0, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

5 PD498FINALMODEL : ASP 58 : N, CA, C, O, CB, CG, 0D1 , 0D2 

PD4 9 8 F INALMODEL : PHE 59 :N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

PD4 9 8 F INALMODEL : ILE 60 :N, CA, C, 0, CB, CGI , CG2 , CD1 

PD4 9 8 F I NALMODE L : PRO 66 :N, CA, CD, C, O, CB, CG 

PD498FINALMODEL : MET 67 :N, CA, C, O, CB, CG, SD, CE 

10 PD498FINALMODEL : ASP 68 :N, CA, C, O, CB , CG, OD1 , OD2 

PD4 9 8 F INALMODEL : LEU 6 9 : N , CA , C , O , CB , CG , CD 1 , CD 2 

PD498FINALMODEL : ASN 70 :N, CA, C, O, CB, CG, OD1 , ND2 

PD498FINALMODEL : GLY 71:N,CA,C,0 

PD498FINALMODEL : HIS 72 : N, CA, C, 0 , CB , CG, ND1 , CD 2 , CE1 , NE2 

15 PD4 98FINALMODEL : GLY 73:N,CA,C,0 

PD4 9 8 F INALMODEL : THR 7 4 : N , CA , C , O , CB , OG1 , CG2 

PD4 98FINALMODEL : HIS 75 : N, CA, C, O, CB , CG, ND1 , CD2 , CE1 , NE2 

PD4 98FINALMODEL : VAL 76 :N, CA, C, O, CB, CGI , CG2 

PD498FINALMODEL : ALA 77 :N, CA, C , 0 , CB 

20 PD498FINALMODEL : GLY 78:N,CA,C,0 

PD498FINALMODEL : THR 79 :N, CA, C, O, CB , OG1 , CG2 

PD4 9 8 FINALMODEL : VAL 8 0 : N , CA , C, 0 , CB , CGI , CG2 

PD498FINALMODEL : LEU 100 : N, CA, C, O, CB , CG, CD1 , CD2 

PD4 98FINALMODEL : ALA 101 : N, CA, C, O, CB 

25 PD4 9 8 FINALMODEL: VAL 102 : N, CA, C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ARG 1 0 3 : N , CA , C , O , CB , CG , CD , NE , CZ , NH 1 , NH2 

PD4 9 8 FINALMODEL : VAL 104 : N, CA, C , O, CB , CGI , CG2 

PD4 9 8 FINALMODEL : LEU 1 0 5 : N , CA , C , O , CB , CG , CD1 , CD2 

PD4 9 8 FINALMODEL : ASP 10 6 : N, CA , C , 0 , CB , CG , OD1 , OD2 

30 PD4 9 8 FINALMODEL : ALA 107 : N, CA, C , 0, CB 

PD4 9 8 FINALMODEL : ASN 1 0 8 : N , CA , C , O , CB , CG , OD1 , ND2 

PD4 9 8 FINALMODEL : GLY 109:N,CA,C,O 

PD4 9 8 FINALMODEL : SER 110 : N, CA, C , 0 , CB , OG 

PD4 9 8 FINALMODEL : SER 115 : N, CA, C , O, CB , OG 

3 5 PD4 9 8 FINALMODEL : ILE 116 : N, CA, C, O, CB , CGI , CG2 , CD1 

PD4 98FINALMODEL :GLY 119:N,CA,C,0 

PD4 9 8 FINALMODEL: ASN 132 : N, CA, C, O, CB , CG, OD1 , ND2 

PD4 9 8 FINALMODEL : LEU 1 3 3 : N , CA , C , 0 , CB , CG , CD 1 , CD2 

PD4 9 8 FINALMODEL : SER 134 : N, CA, C, 0, CB , OG 

40 PD4 9 8 FINALMODEL : LEU 13 5 : N, CA, C, 0, CB , CG, CD1 , CD2 

PD4 9 8 FINALMODEL: GLY 136:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 160 : N, CA, C , 0, CB 

PD4 9 8 FINALMODEL: ALA 161 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL : ALA 162 :N, CA, C , O, CB 

45 PD4 9 8 FINALMODEL: GLY 163:N,CA,C,0 

PD4 9 8 FINALMODEL : ASN 1 64 : N , CA , C , 0 , CB , CG , OD1 , ND2 

PD4 98FINALMODEL : VAL 182 : N, CA, C, 0, CB , CGI , CG2 

PD498FINALMODEL :GLY 183:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 184 : N, CA, C, O, CB 

5 0 PD4 9 8 FINALMODEL : PHE 1 94 : N , CA , C , O , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ 

PD4 9 8 FINALMODEL : PRO 2 06 : N, CA, CD , C , O , CB , CG 

PD4 9 8 FINALMODEL: GLY 207:N,CA,C,O 

PD4 9 8 F INALMODEL : I LE 2 1 0 : N , CA , C , O , CB , CGI , CG2 , CD 1 

PD498FINALMODEL : SER 212 : N, CA, C, O, CB, OG 

55 PD4 9 8 FINALMODEL : THR 213 :N, CA, C, O, CB, OG1, CG2 

PD4 9 8 FINALMODEL : VAL 2 1 4 : N , CA , C , 0 , CB , CGI , CG2 

PD4 9 8 F INALMODEL : PRO 2 1 5 : N , CA , CD , C , O , CB , CG 

PD4 9 8 F INALMODEL : MET 2 2 2 : N , CA , C , O , CB , CG , SD , CE 
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PD 4 9 8 F I NALMODE L : SER 223 :N, CA, C, O, CB , OG 

PD4 98FINALMODEL : GLY 224:N / CA / C / 0 

PD 4 9 8 F I NALMODE L : THR 225:N / CA / C / 0 / CB, OG1 , CG2 

PD4 9 8 F INALMODEL : SER 226 :N, CA, C, O, CB, OG 

5 PD4 98 FINALMODEL : MET 227 :N, CA, C, O, CB, CG, SD, CE 

PD4 98 F INALMODEL: ALA 228 : N, CA, C, O, CB 

PD498FINALMODEL : SER 22 9 : N, CA, C, O, CB , OG 

PD4 9 8 FINALMODEL : PRO 230 :N, CA, CD, C, O, CB, CG 

PD4 98FINALMODEL : HIS 231 : N, CA, C, O, CB, CG, ND1 , CD2 , CE1 , NE2 

10 

Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL: 6-7, 9-12,43-46, 65, 87-89, 131, 173,209,211,216-221,232-233, 
15 NEWMODEL:262,E282H 

restxatom. list 
Subset RESTX: 

NEWMODEL : PRO 6 :N, CA, CD, C, O, CB, CG 
2 0 NEWMODEL : TYR 7 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL: SER 9 :N, CA, C, O, CB, OG 
NEWMODEL: ALA 10 : N, CA, C, 0, CB 

NEWMODEL : TYR 11 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ, OH 

NEWMODEL : GLN 1 2 : N , CA , C , O , CB , CG , CD , OE 1 , NE2 
25 NEWMODEL : AS P 43 :N, CA, C, O, CB , CG, OD1 , OD2 

NEWMODEL : TYR 44 : N, CA, C, 0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL : ASN 4 5 : N , CA , C , O , CB , CG , OD 1 , ND2 

NEWMODEL : HIS 46 : N, CA, C, O, CB, CG, ND1 , CD2 , CE1 , NE2 

NEWMODEL : ASN 6 5 : N , CA , C , 0 , CB , CG , OD1 , ND2 
30 NEWMODEL :GLY 87:N,CA,C,0 

NEWMODEL : I LE 8 8 : N , CA , C , 0 , CB , CGI , CG2 , CD 1 

NEWMODEL :GLY 89:N,CA,C,0 

NEWMODEL : LEU 13 1 : N , CA, C , 0 , CB , CG, CD1 , CD2 

NEWMODEL: PRO 173 : N, CA, CD, C, O, CB, CG 
35 NEWMODEL : ASN 209 :N, CA, C, 0, CB , CG, OD1 , ND2 

NEWMODEL: ALA 211 :N, CA, C, 0, CB 

NEWMODEL : ASN 2 1 6 : N , CA , C , 0 , CB , CG , OD 1 , ND2 

NEWMODEL : ASN 2 17 : N, CA, C , 0 , CB , CG, OD1 , ND2 

NEWMODEL :GLY 218:N,CA,C,0 
40 NEWMODEL: TYR 

219:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWMODEL: SER 220 :N, CA, C, O, CB, OG 

NEWMODEL : TYR 221 : N, CA, C, 0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
NEWMODEL : VAL 232 :N, CA, C , O, CB , CGI , CG2 
45 NEWMODEL: ALA 233 :N, CA, C, 0, CB 

NEWMODEL :GLY 262:N,CA,C,0 
NEWMODEL : C A E282H:CA 



Example 2 

50 Suitable substitutions in SAVINASE® for addition of amino attachment 
groups ( -NH 2 ) 

The known X-ray structure of SAVINASE® was used to find where 
suitable amino attachment groups may is added (Betzel et al, 1992, J. 
Mol. Biol., 223, 427-445). 
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The 3D structure of SAVINASE® is available in the Brookhaven 
Databank as Isvn.pbd. A related subtilisin is available as lst3.pdb. 

The sequence of SAVINASE® is shown in SEQ ID NO: 3. The sequence 
numbering used is that of subtilisin BPN' , SAVINASE® having deletions 
5 relative to BPN'at positions 36, 56, 158-159 and 163-164. The active 
site residues (functional site) are D32, H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone .bcl and makeKzone2 .bcl below: 

10 Conservative substitutions : 
makeKzone* bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer/residue 10 Color_Subset 
15 255,255,0 

Zone Subset NTERM :el:N Static monomer/residue 10 Color_Subset 
255,255, 0 

#NOTE: editnextline ACTSITE residues according to the protein 

Zone Subset ACTSITE : e32 , e64 , e221 Static monomer/ residue 8 

20 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#NOTE: editnextline object name according to the protein 

Combine Subset REST Difference SAVI8 ALLZONE 

25 List Subset REST Atom Output_File restatom. list 

List Subset REST monomer/residue Output_File restmole . list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom OutputJFile actsiteatom. list 
List Subset ACTSITE monomer/ residue Output_File actsitemole . list 

30 # 

Zone Subset REST5A REST Static Monomer /Re si due 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB 5 A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
35 List Subset SUB5B Atom Output_File subSbatom. list 

List Subset SUB5B monomer/ residue Output_File sub5bmole . list 

#Now identify sites for lys->arg substitutions and continue with 

make zone 2 .bcl 
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#Use grep command to identify ARG in restatom. list , sub5batom. list & 
accsiteatom. list 

Comments : 

5 In this case of SAVINASE® REST contains the arginines ArglO, 

Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247. 

These residues are all solvent exposed. The substitutions R10K, 
R19K, R45K, R145K, R170K, R186K and R247K are identified in SAVINASE® 
10 as sites for mutagenesis within the scope of this invention. The 
residues are substituted below in section 2, and further analysis 
done. The subset ACTSITE contains Lys94. 

The substitution K94R is a mutation removing lysine as attachment 
group close to the active site. 

15 

Non- conservative substitutions: 
makeKzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

2 0 #having scanned lists (grep arg command) and identified sites for lys- 
>arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
Biopolymer 

25 #NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE: editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 

Replace Residue newmodel : el70 lys L 
30 Replace Residue newmodel : el86 lys L 

Replace Residue newmodel :el9 lys L 

Replace Residue newmodel :e45 lys L 

Replace Residue newmodel : el45 lys L 

Replace Residue newmodel : e241 lys L 
35 # 

#Now repeat analysis done prior to arg->lys, now including introduced 
lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset LYSx newmodel : lys :NZ Static monomer /re si due 10 
Color_Subset 255,255,0 

Zone Subset NTERMx newmodel : el :N Static monomer/residue 10 
Color_Subset 255,255,0 
5 #NOTE: editnextline ACTSITEx residues according to the protein 

Zone Subset ACTSITEx newmodel : e3 2 , e64 , e221 Static monomer/ residue 8 
Color_Subset 255,255,0 

Combine Subset ALLZONEx Union LYSx NTERMx 
Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
10 Combine Subset RESTx Difference newmodel ALLZONEx 
List Subset RESTx Atom Output_File restxatom. list 
List Subset RESTx monomer/residue Output_File restxmole . list 
# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
15 List Subset ACTSITEx Atom Output_File actsitexatom. list 

List Subset ACTSITEx monomer /residue Output_File actsitexmole . list 
# 

#read restxatom. list or restxmole . list to identify sites for 
(not_arg) ->lys subst. if needed 

20 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 5, 14, 22, 38-40, 42, 75-76, 82, 86, 103-105, 108, 133- 
135, 137, 140, 173, 204, 206, 211-213, 215-216, 269. The following 
25 mutations are proposed in SAVINASE®: P5K, P14K, T22K, T38K, H39K, 
P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, T213K, 
A215K, S216K, N269K. 
Relevant data for Example 2 : 

3 0 Solvent accessibility data for SAVINASE® : 

# SAVI8NOH20 Fri Nov 29 13:32:07 MET 1996 

# residue area 
ALA_1 118.362808 
GLN_2 49.422764 

35 SER_3 61.982887 

VAL_4 71.620255 

PRO_5 21.737535 

TRP_6 58.718731 

GLYJ7 4.328117 

40 ILE_8 6.664074 

SER_9 60.175900 

ARG_10 70.928963 

VAL_11 2.686934 

GLN 12 72.839996 
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Subset REST: 

restmole . list 
30 Subset REST: 

SAVI8 :E5-E15 , E17-E18 , E22 , E38 -E4 0 , E42 -E43 , E73 -E76 , E82-E86 , E103 -E105 , 
SAVI8 :E108-E109,E111-E112,E115-E116,E122,E128-E144,E149-E150 / E156- 
E157, 

SAVI8 :E160-E162,E165-E168,E170-E171,E173,E180-E188,E190-E192 / E200, 
35 SAVI8 :E203-E2 04,E2 06,E211-E213,E215-E216,E227-E230 / E255-E259 / E261- 
E262, 

SAVI8:E267-E269 

restatom. list 
Subset REST: 
4 0 SAVI8:PR0 E5 : N, CD, CA, CG, CB , C , O 

SAVI8 :TRP E6 : N, CA, CD2 , CE2 , NE1 , CD1 , CG, CE3 , CZ3 , CH2 , CZ2 , CB , C, 0 
SAVI8:GLY E7:N,CA,C,0 

SAVI8 : ILE E8 : N, CA, CD1 , CGI , CB , CG2 , C, 0 

SAVI8 : SER E9 : N, CA, OG, CB , C , 0 
45 SAVI8:ARG E10 : N, CA, NH2 , NH1 , CZ , NE , CD, CG, CB , C, O 

SAVI8:VAL Ell :N, CA, CG2 , CGI , CB , C, 0 

SAVI8 :GLN E12 : N, CA, NE2 , 0E1 , CD, CG, CB , C, 0 

SAVI8:ALA E13 : N, CA, CB, C, O 

SAVI8:PRO E14 : N, CD , CA, CG, CB , C , 0 
50 SAVI8:ALA E15 : N , CA, CB , C , O 

SAVI8 : HIS E17 : N , CA, CD2 , NE2 , CE1 , ND1 , CG, CB , C , O 

SAVI8 :ASN E18 : N, CA, ND2 , OD1 , CG, CB , C , 0 

SAVI8:THR E22 : N, CA, CG2 , OG1 , CB , C, 0 

SAVI8:THR E38 : N, CA, CG2 , 0G1 , CB, C, 0 
55 SAVI8:HIS E39 : N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB, C, O 

SAVI8:PRO E40 : N, CD , CA, CG, CB , C, 0 

SAVI8 :LEU E42 : N, CA, CD2 , CD1 , CG, CB , C , 0 

SAVI8:ASN E43 : N, CA, ND2 , OD1 , CG, CB , C, 0 



58 



SAVI8:ALA E73 :N, CA, CB, C, O 

SAVI8:ALA E74 : N, CA, CB , C # O 

SAVI8 : LEU E75 :N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:ASN E76 :N, CA, ND2 , OD1 , CG, CB , C, O 
5 SAVI8:LEU E82 :N, CA, CD2 , CD1 , CG, CB , C, 0 

SAVI8:GLY E83:N,CA,C,0 

SAVI8:VAL £84:1^^^02,001,06,0,0 

SAVI8:ALA E85 :N, CA, CB, C, O 

SAVI8:PR0 E86 :N, CD, CA, CG, CB, C, O 
10 SAVI8:SER E103 : N, CA, OG, CB , C , 0 

SAVI8:VAL E104 :N, CA, CG2 , CGI , CB, C, O 

SAVI8:SER E105 : N, CA, 00, CB , C , O 

SAVI8:ALA E108 :N, CA, CB , C, 0 

SAVI8:GLN E109 :N, CA, NE2 , 0E1 , CD, CG, CB, C, O 
15 SAVI8 :LEU Elll :N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8 : GLU E112 : N, CA, 0E2 , 0E1 , CD , CG, CB , C , O 

SAVI8:GLY E115 :N, CA, C, O 

SAVI8 :ASN E116 : N, CA, ND2 , 0D1 , CG, CB , C , O 

SAVI8:ALA E122 :N , CA, CB , C , 0 
20 SAVI8:SER E128 : N, CA, OG, CB , C , 0 

SAVI8:PR0 E129 : N, CD , CA, CG, CB , C, 0 

SAVI8:SER E130 :N, CA, OG, CB, C, O 

SAVI8:PR0 E131 :N, CD, CA, CG, CB , C , 0 

SAVI8:SER E132 : N, CA, OG, CB, C, 0 
25 SAVI8:ALA E133 : N, CA, CB , C, 0 

SAVI8 : THR E134 :N, CA, CG2 , 0G1 , CB, C, 0 

SAVI8 : LEU E135 :N, CA, CD 2 , CD1 , CG, CB , C, 0 

SAVI 8 : GLU E136 : N, CA, 0E2 , 0E1 , CD, CG, CB, C, O 

SAVI 8 : GLN E137 :N, CA, NE2 , 0E1 , CD, CG, CB, C, O 
30 SAVI8:ALA E138 : N, CA, CB, C, 0 

SAVI8:VAL E139 :N, CA, CG2 , CGI , CB, C, 0 

SAVI 8 :ASN E140 : N, CA, ND2 , 0D1 , CG, CB , C, 0 

SAVI8:SER E141 : N, CA, OG, CB, C, 0 

SAVI8:ALA E142 : N, CA, CB, C, 0 
35 SAVI8:THR E143 :N, CA, CG2 , 0G1 , CB, C, 0 

SAVI8:SER E144 : N, CA, OG, CB , C, O 

SAVI 8 :VAL E149 : N, CA, CG2 , CGI , CB, C, 0 

SAVI8:VAL E150 : N, CA, CG2 , CGI , CB , C , 0 

SAVI8:SER E156 :N, CA, OG, CB, C, O 
40 SAVI8:GLY E157 :N, CA, C, O 

SAVI8:ALA E160 : N, CA, CB, C, 0 

SAVI8:GLY E161 : N, CA, C, O 

SAVI8:SER E162 : N, CA, OG, CB, C, O 

SAVI 8 : ILE E165 :N, CA, CD1 , CGI , CB, CG2 , C, O 
45 SAVI8:SER E166 : N, CA, OG, CB, C, 0 

SAVI 8 : TYR E167 :N, CA, OH, CZ , CD2 , CE2 , CE1 , CD1 , CG, CB, C, O 

SAVI8:PR0 E168 : N, CD, CA, CG, CB , C, 0 

SAVI 8 : ARG E170 :N, CA,NH2 , NH1, CZ,NE, CD, CG, CB, C, O 

SAVI 8 : TYR E171 :N, CA, OH, CZ, CD2 , CE2 , CE1 , CD1 , CG, CB, C, 0 

50 SAVI8:ASN E173 : N, CA, ND2 , OD1 , CG, CB, C, 0 
SAVI 8 : THR E180 :N, CA, CG2 , 0G1, CB, C, 0 
SAVI 8 :ASP E181 :N, CA, 0D2 , OD1 , CG, CB , C, O 
SAVI 8 : GLN E182 : N, CA, NE2 , OE1 , CD, CG, CB, C, O 
SAVI 8 :ASN E183 : N, CA, ND2 , 0D1 , CG, CB , C, 0 

55 SAVI8:ASN E184 :N, CA, ND2 , 0D1, CG, CB, C, O 
SAVI 8 : ASN E185 : N, CA, ND2 , 0D1 , CG, CB, C, 0 
SAVI 8 : ARG E186 : N, CA, NH2 , NH1 , CZ , NE , CD, CG, CB , C, 0 
SAVI8:ALA E187 : N, CA, CB, C, O 



59 



SAVI8:SER E188 :N, CA, OG, CB, C, O 

SAVI8:SER E190 : N, CA, OG, CB , C , O 

SAVI 8 : GLN E191 : N, CA, NE2 , OE1 , CD, CG, CB, C, O 

SAVI 8 : TYR E192 : N, CA, OH, CZ , CD 2 , CE2 , CE1 , CD1 , CG, CB, C, O 
5 SAVI8:ALA E200 : N, CA, CB, C, O 

SAVI8:VAL E203 :N, CA, CG2 , CGI , CB, C, O 

SAVI 8 : ASN E2 0 4 : N , CA , ND2 , OD1 , CG , CB , C , O 

SAVI 8 : GLN E206 : N, CA, NE2 , OE1 , CD , CG, CB , C , O 

SAVI8:GLY E211 :N, CA, C, O 
10 SAVI8:SER E212 : N, CA, OG, CB , C, O 

SAVI8:THR E213 : N, CA, CG2 , OG1 , CB , C, 0 

SAVI8:ALA E215 : N, CA, CB, C, O 

SAVI8:SER E216 :N, CA, OG, CB , C, O 

SAVI8:VAL E22 7 : N, CA, CG2 , CGI , CB , C , O 
15 SAVI8:ALA E228 : N, CA, CB, C, O 

SAVI8:GLY E22 9 : N, CA, C , O 

SAVI8:ALA E23 0 : N, CA, CB, C, O 

SAVI 8 : THR E255 : N, CA, CG2 , OG1 , CB , C, O 

SAVI8:SER E256 : N, CA, OG, CB , C , O 

2 0 SAVI8:LEU E257 : N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:GLY E258 : N, CA, C, O 
SAVI8:SER E259 :N, CA, OG, CB, C, O 
SAVI 8 :ASN E261 :N, CA, ND2 , OD1 , CG, CB, C, 0 
SAVI 8 : LEU E262 : N, CA, CD2 , CD1 , CG, CB , C, O 
25 SAVI8:LEU E267 : N, CA, CD2 , CD1 , CG, CB , C , 0 
SAVI8:VAL E268 : N, CA, CG2 , CGI , CB , C, 0 
SAVI8:ASN E269 :N, CA,ND2 , OD1, CG, CB, C,0 
Subset SUB5B: 

sub5bmole .list 

3 0 Subset SUB5B: 

SAVI 8 : E2 -E4 , E16 , E19-E21 , E23 -E24 , E2 8 , E37 , E41 , E44 -E45 , 
E77-E81,E87-E88, 

SAVI 8 :E90,E113-E114,E117-E118,E120-E121, E145-E148 ,E169,E172,E174-E176, 
SAVI 8 :E193-E196 / E198-E199,E214,E231-E234,E236,E243,E247,E250,E253- 
35 E254, 

SAVI 8 :E260 , E2 63 -E2 66 , E270-E273 , M276H-M277H 

sub5batom. list 
Subset SUB5B: 

SAVI 8 : GLN E2 : N, CA, NE2 , OE1 , CD , CG, CB , C , O 
40 SAVI8:SER E3 : N, CA, OG, CB , C , O 

SAVI8:VAL E4 : N, CA, CG2 , CGI , CB , C , O 
SAVI8:ALA E16 : N, CA, CB , C , O 

SAVI 8 : ARG E19 :N, CA, NH2 , NH1 , CZ , NE, CD, CG, CB , C, O 

SAVI8:GLY E20:N,CA,C,O 
45 SAVI8:LEU E21 :N, CA, CD2 , CD1 , CG, CB, C, 0 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24 :N, CA, OG, CB, C, 0 

SAVI8:VAL E28 :N, CA, CG2 , CGI , CB , C, 0 

SAVI8:SER E37 :N, CA, OG, CB , C, 0 
50 SAVI8:ASP E41 :N, CA, OD2 , OD1 , CG , CB, C , O 

SAVI 8 : ILE E44 : N, CA, CD1 , CGI , CB , CG2 , C, O 

SAVI 8 : ARG E45 :N, CA, NH2 , NH1, CZ ,NE, CD, CG, CB, C, O 

SAVI 8 :ASN E77 :N, CA, ND2 , OD1 , CG, CB, C, O 

SAVI8:SER E78 : N, CA, OG, CB , C, O 
55 SAVI8:ILE E79 :N, CA, CD1 , CGI , CB, CG2 , C, O 

SAVI8:GLY E80:N,CA,C,O 

SAVI8:VAL E81 :N, CA, CG2 , CGI , CB, C, O 

SAVI8:SER E87 :N, CA, OG, CB , C, O 
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SAVI8:ALA E88 :N, CA, CB, C, O 

SAVI8 : LEU E90 : N # CA, CD2 , CD1 , CG, CB , C , O 

SAVI8 : TRP E113 : N, CA, CD 2 , CE2 , NE1 , CD1 , CG, CE3 , CZ3 , CH2 , CZ2 , CB , C , O 
SAVI8:ALA E114 :N, CA, CB, C, O 
5 SAVI8:ASN E117 : N, CA, ND2 , OD1 , CG, CB , C, 0 
SAVI8:GLY E118 : N, CA, C, O 

SAVI8:HIS E120 : N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB, C, O 

SAVI8:VAL E121 :N, CA, CG2 , CGI , CB , C, 0 

SAVI8:ARG E145 : N, CA, NH2 , NH1 , CZ , NE , CD, CG, CB , C, 0 
10 SAVI8:GLY E146 : N, CA, C , O 

SAVI8:VAL E147 :N, CA, CG2 , CGI , CB, C, 0 

SAVI8 : LEU E148 : N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:ALA E169 : N, CA, CB , C, O 

SAVI8:ALA E172 :N, CA, CB, C, O 
15 SAVI8:ALA E174 :N, CA, CB, C , O 

SAVI8:MET E175 :N, CA, CE, SD, CG, CB, C, 0 

SAVI8:ALA E176 :N, CA, CB, C, O 

SAVI8:GLY E193:N,CA,C,0 

SAVI8:ALA El 94 :N, CA, CB , C , O 
20 SAVI8:GLY E195 : N, CA, C, 0 

SAVI8 : LEU E196 : N, CA, CD2 , CD1 , CG, CB, C, 0 

SAVI8 : ILE E198 : N, CA, CD1 , CGI , CB, CG2 , C, 0 

SAVI8:VAL E199 : N , CA ; CG2 , CGI , CB , C , 0 

SAVI8 : TYR E214 :N, CA, OH, CZ, CD2 , CE2 , CE1 , CD1 , CG, CB, C, O 
25 SAVI8:ALA E231 : N, CA, CB , C, O 

SAVI8:ALA E232 : N, CA, CB , C , O 

SAVI8 : LEU E233 : N, CA, CD2 , CD1 , CG, CB , C, O 

SAVI8:VAL E234 : N, CA, CG2 , CGI , CB , C, 0 

SAVI8 : GLN E236 :N, CA, NE2 , OE1 , CD, CG, CB, C, O 
30 SAVI8:ASN E243 : N, CA, ND2 , OD1 , CG, CB , C, 0 

SAVI8:ARG E247 :N, CA, NH2 , NH1 , CZ , NE, CD, CG, CB , C, O 

SAVI 8 : LEU E2 5 0 : N , CA , CD2 , CD 1 , CG , CB , C , 0 

SAVI8 : THR E253 :N, CA, CG2 , OG1 , CB , C, 0 

SAVI8:ALA E254 : N, CA, CB , C, O 
35 SAVI8:THR E260 :N, CA, CG2 , OG1 , CB , C, O 

SAVI 8 : TYR E263 : N, CA, OH, CZ , CD 2 , CE2 , CE1 , CD1 , CG, CB , C, O 

SAVI8:GLY E264 : N, CA, C , 0 

SAVI8:SER E265 :N, CA, OG, CB, C, 0 

SAVI8:GLY E266 : N, CA, C, O 
40 SAVI8:ALA E270 :N, CA, CB, C, O 

SAVI8:GLU E271 :N, CA, OE2 , OE1 , CD , CG, CB, C, 0 

SAVI8:ALA E272 :N, CA, CB , C, O 

SAVI8:ALA E273 : N, CA, CB , C, O 

SAVI8:ION M276H:CA 
45 SAVI8:ION M277H:CA 

Subset ACTSITE: 

actsitemole . list 

Subset ACTSITE: 

SAVI 8 : E29-E35 , E48 -E51 , E54 , E58-E72,E91-E102,E106-E107,E110,E123-E127, 
50 SAVI 8 : E151-E155 , E177-E179 , E189 , E201-E202 , E205 , E207-E210 , E217-E226 

actsiteatom. list 
Subset ACTSITE: 

SAVI8:ALA E29 : N, CA, CB , C , O 
55 SAVI8:VAL E30 :N, CA, CG2 , CGI , CB , C, O 

SAVI 8 : LEU E31 :N, CA, CD2 , CD1 , CG, CB , C,0 
SAVI8:ASP E32 :N, CA, OD2 , OD1 , CG, CB , C, O 
SAVI8:THR E33 : N, CA, CG2 , OG1 , CB , C , O 
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SAVI8:GLY E34:N,CA,C,0 

SAVI8 : ILE E3 5 : N, CA, CD1 , CGI , CB , CG2 ,0,0 
SAVI8:ALA E48 : N, CA, CB , C, 0 
SAVI8:SER E4 9 : N, CA, OG, CB , C , 0 
5 SAVI8:PHE E50 : N, CA, CD2 , CE2 , CZ , CE1 , CD1 , CG, CB , C, 0 

SAVI8:VAL E51 : N, CA, CG2 , CGI , CB , C, 0 
SAVI 8 : GLU E5 4 : N , CA , 0E2 , OE 1 , CD , CG , CB , C , O 
SAVI8 : THR E58 : N, CA, CG2 , 0G1 , CB , C, O 
SAVI 8 :GLN E59 :N, OA, NE2 , 0E1 , CD, CG, CB, C, O 
10 SAVI8:ASP E60 : N, CA, 0D2 , 0D1 , CG, CB , C, 0 

SAVI8:GLY E61:N,CA,C,0 

SAVI 8 :ASN E62 : N, CA, ND2 , 0D1 , CG, CB , C, O 
SAVI8:GLY E63:N,CA,C,0 

SAVI 8 : HIS E64 : N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB , C , 0 
15 SAVI8:GLY E65:N,CA,C,0 

SAVI8:THR E66 : N, CA, CG2 , 0G1 , CB, C, O 

SAVI8:HIS E67 : N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB , C, 0 

SAVI8:VAL E68 : N, CA, CG2 , CGI , CB , C, 0 

SAVI 8 : ALA E69 :N, CA, CB, C, O 
20 SAVI8:GLY E70:N,CA,C,O 

SAVI8:THR E71 :N, CA, CG2 , OGl , CB , C, O 

SAVI 8 : ILE E72 :N, CA, CD1 , CGI , CB, CG2 , C, O 

SAVI8:TYR E91 : N, CA, OH , CZ , CD2 , CE2 , CE1 , CD1 , CG, CB , C , O 

SAVI8:ALA E92 : N, CA, CB , C , O 
25 SAVI8:VAL E93 :N, CA, CG2 , CGI , CB , C, O 

SAVI 8 : LYS E 94 : N , CA , NZ , CE , CD , CG , CB , C , O 

SAVI8:VAL E95 : N, CA, CG2 , CGI , CB , C, O 

SAVI 8 : LEU E96 : N, CA, CD2 , CD1 , CG, CB , C, 0 

SAVI8:GLY E97:N,CA,C,0 
30 SAVI8:ALA E98 :N, CA, CB, C, 0 

SAVI8:SER E99 :N, CA, OG, CB , C, 0 

SAVI8:GLY E100 :N, CA, C, O 

SAVI8:SER E101 :N, CA, OG, CB, C, 0 

SAVI8:GLY E102 :N, CA, C, O 
35 SAVI8:SER E106 :N, CA, OG, CB , C, 0 

SAVI 8 : ILE E107 : N, CA, CD1 , CGI , CB, CG2 , C, O 

SAVI8:GLY E110 :N, CA, C, O 

S AV 1 8 : ASN E 1 2 3 : N , CA , ND 2 , 0D1 , CG , CB , C , O 

SAVI 8 : LEU E124 :N, CA, CD2 , CD1 , CG, CB, C, 0 
40 SAVI8:SER E125 :N, CA, OG, CB , C, 0 

SAVI 8 : LEU E126 :N, CA, CD2 , CD1 , CG, CB, C, O 

SAVI8:GLY E127 :N, CA, C, O 

SAVI8:ALA E151 :N, CA, CB , C , 0 

SAVI8:ALA El 5 2 :N,CA,CB, C,0 
45 SAVI8:SER E153 :N, CA, OG, CB, C, O 

SAVI8:GLY E154 :N, CA, C, O 

SAVI8:ASN E155 : N, CA, ND2 , OD1 , CG, CB , C, 0 

SAVI8:VAL E177 :N, CA, CG2 , CGI , CB, C, 0 

SAVI8:GLY E178 :N, CA, C, O 
50 SAVI8:ALA E179 :N, CA, CB , C , O 

SAVI 8 : PHE E18 9 : N , CA , CD 2 , CE2 , CZ , CE1 , CD1 , CG , CB , C , 0 

SAVI8:PRO E201 :N, CD, CA, CG, CB , C, O 

SAVI8:GLY E202 :N, CA, C, O 

SAVI8:VAL E205 :N, CA, CG2 , CGI , CB, C, 0 
55 SAVI8:SER E207 :N, CA, OG, CB, C, 0 

SAVI 8 : THR E2 08 : N , CA, CG2 , OGl , CB , C , 0 

SAVI 8 : TYR E209 :N, CA, OH, CZ , CD2 , CE2 , CE1 , CD1 , CG, CB, C, O 
SAVI8:PRO E210 :N, CD, CA, CG, CB , C, 0 
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SAVI8 : LEU E217 : N, CA, CD2 , CD1 , CG, CB , C , 0 
SAVI8:ASN E218 :N, CA, ND2 , 0D1 , CG, CB, C, O 
SAVI8:GLY E219 : N, CA, C, 0 
S AV 1 8 : THR E 2 2 0 : N , CA , CG2 , OG 1 , CB , C , 0 
5 SAVI8:SER E221 :N, CA, OG, CB, C, 0 

SAVI8:MET E222 :N, CA, CE, SD, CG, CB, C, 0 
SAVI8:ALA E223 : N, CA, CB , C, O 
SAVI 8 : THR E224 : N, CA, CG2 , 0G1 , CB, C, 0 
SAVI8:PR0 E225 : N, CD , CA, CG, CB , C , O 
10 SAVI8:HIS E226 :N, CA, CD2 , NE2 , CE1 , ND1 , CG, CB, C, O 

Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL :E5,E13-E14,E22,E38-E40,E42,E73-E76 / E82-E86,E103-E105, 
15 NEWMODEL :E108,E122,E133-E135,E13 7-E14 0,E149-E150,E173,E204,E206, 

NEWMODEL :E2 11 -E213,E215-E216,E227-E22 9, E258,E2 69 
restxatom. list 
Subset RESTX: 

NEWMODEL : PRO E5 :N, CD , CA, CG, CB, C ,0 
20 NEWMODEL : ALA E13 :N, CA, CB , C , 0 

NEWMODEL: PRO E14 :N, CD, CA, CG, CB, C, O 

NEWMODEL : THR E22 :N, CA, CG2 , OG1 , CB, C, O 

NEWMODEL : THR E38 :N, CA, CG2 , OG1 , CB , C, O 

NEWMODEL : HI S E3 9 : N , CA , CD2 , NE2 , CE 1 , ND1 , CG , CB , C , O 
25 NEWMODEL : PRO E40 :N, CD , CA, CG, CB , C , 0 

NEWMODEL: LEU E42 :N, CA, CD2 , CD1 , CG, CB, C, O 

NEWMODEL: ALA E73 :N, CA, CB , C , O 

NEWMODEL: ALA E74 :N, CA, CB , C, O 

NEWMODEL : LEU E7 5 : N , CA , CD2 , CD1 , CG , CB , C , O 
30 NEWMODEL :ASN E76 :N, CA, ND2 , OD1 , CG, CB, C, O 

NEWMODEL: LEU E82 :N, CA, CD2 , CD1 , CG, CB, C, O 

NEWMODEL :GLY E83:N,CA,C,0 

NEWMODEL :VAL E84 : N, CA, CG2 , CGI , CB , C , O 

NEWMODEL: ALA E85 :N, CA, CB , C, 0 
35 NEWMODEL: PRO E86 :N, CD, CA, CG, CB, C, 0 

NEWMODEL : SER E103 : N, CA, OG, CB, C , O 

NEWMODEL : VAL El 04 : N , CA, CG2 , CGI , CB , C , 0 

NEWMODEL: SER E105 : N , CA, OG, CB , C, 0 

NEWMODEL: ALA E108 : N, CA, CB , C, O 
40 NEWMODEL : ALA E122 : N, CA, CB , C , 0 

NEWMODEL: ALA E133 : N, CA, CB, C, 0 

NEWMODEL : THR E134 : N, CA, CG2 , OG1 , CB , C, 0 

NEWMODEL : LEU E135 : N, CA, CD2 , CD1 , CG, CB, C, 0 

NEWMODEL : GLN E137 : N , CA, NE2 , OE1 , CD , CG , CB , C , O 
45 NEWMODEL : ALA E138 :N, CA, CB, C, 0 

NEWMODEL : VAL E139 : N, CA, CG2 , CGI , CB , C, 0 

NEWMODEL : ASN E14 0 : N , CA, ND2 , OD1 , CG , CB , C , 0 

NEWMODEL : VAL El 4 9 : N , CA , CG2 , CGI , CB , C, 0 

NEWMODEL : VAL El 5 0 : N , CA , CG2 , CGI , CB , C , 0 
50 NEWMODEL : ASN E173 : N, CA, ND2 , OD1 , CG, CB , C , 0 

NEWMODEL : ASN E2 0 4 : N , CA , ND2 , OD1 , CG , CB , C , O 

NEWMODEL : GLN E2 0 6 : N , CA , NE 2 , OE 1 , CD , CG , CB , C , 0 

NEWMODEL : GLY E2 11 : N, CA, C, 0 

NEWMODEL : SER E212 : N, CA, OG, CB, C, 0 
55 NEWMODEL : THR E213 :N, CA, CG2 , OG1 , CB , C, 0 

NEWMODEL : ALA E215 : N, CA, CB , C, 0 

NEWMODEL: SER E216 : N, CA, OG, CB, C, 0 

NEWMODEL : VAL E227 : N, CA, CG2 , CGI , CB , C, 0 
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NEWMODEL : ALA E228 : N, CA, CB , C, 0 

NEWMODEL : GLY E229 : N, CA, C , O 

NEWMODEL : GLY E258 : N, CA, C, O 

NEWMODEL : ASN E269 : N, CA, ND2 , 0D1 , CG, CB , C , O 

5 

Example 3 

Suitable substitutions in PD498 for addition of carboxylic acid 
attachment groups (-C00H) 

The 3D structure of PD498 was modeled as described in Example 1. 
10 Suitable locations for addition of carboxylic attachment groups 
(aspartatic acids and glutamic acids) were found as follows. The 
procedure described in Example 1 was followed. The commands performed 
in Insight (BIOSYM) are shown in the command files makeDEzone .bcl and 
makeDEzone2 .bcl below: 

15 

Conservative substutitions : 
makeDE z one . be 1 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 

2 0 Zone Subset ASP :asp:od* Static monomer/residue 10 Color_Subset 

255,255,0 

Zone Subset GLU :glu:oe* Static monomer/residue 10 Color_Subset 
255,255, 0 

#N0TE: editnextline C-terminal residue number according to the protein 
25 Zone Subset CTERM :280:O Static monomer /residue 10 Color_Subset 
255, 255, 0 

#N0TE: editnextline ACTSITE residues according to the protein 

Zone Subset ACTSITE : 39, 72,226 Static monomer/ residue 8 Color_Subset 

255,255, 0 

3 0 Combine Subset ALLZONE Union ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#N0TE: editnextline object name according to the protein 

Combine Subset REST Difference PD4 9 8 F INALMODEL ALLZONE 

35 List Subset REST Atom Output_File restatom. list 

List Subset REST monomer /re si due Output_File restmole . list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output_File actsiteatom. list 
List Subset ACTSITE monomer/ re si due Output_File actsitemole . list 

40 # 
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Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
5 List Subset SUB5B Atom OutputJFile subSbatom. list 

List Subset SUB5B monomer /re si due Output_File subSbmole . list 
#Now identify sites for asn->asp & gln->glu substitutions and ... 
#continue with makezone2 .bcl . 

#Use grep command to identify asn/gln in restatom. list ... 
10 #sub5batom. list & accsiteatom. list 

Comments : 

The subset REST contains Gln33 and Asn245, SUB5B contains Glnl2 / 
Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all of which are 
15 solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or Q126D, 
N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or N246E, Q248E 
or Q248D and N266D or N266E are identified in PD498 as sites for 
mutagenesis within the scope of this invention. Residues are 
20 substituted below in section 2, and further analysis done: 

Non-conservative substitutions: 
makeDEzone2 .bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
25 # 

#having scanned lists (grep gln/asn command) and identified sites for 
#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD498FINALMODEL newmodel 
3 0 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On PD4 9 8 FINALMODEL 

#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel: 33 glu L 
35 Replace Residue newmodel: 245 asp L 
Replace Residue newmodel: 12 glu L 
Replace Residue newmodel: 126 glu L 
Replace Residue newmodel :209 asp L 
Replace Residue newmodel: 242 glu L 



65 



Replace Residue newmodel : 246 asp L 
Replace Residue newmodel: 248 glu L 
Replace Residue newmodel : 266 asp L 
# 

5 #Now repeat analysis done prior to asn->asp & gln->glu, . . . 
#now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx newmodel : asp : od* Static monomer/residue 10 
Color_Subset 255,255,0 
10 Zone Subset GLUx newmodel : glu toe* Static monomer/residue 10 
ColorjSubset 255,255,0 

#NOTE: editnextline C- terminal residue number according to the protein 
Zone Subset CTERMx newmodel : 280 : 0 Static monomer/residue 10 
Color_Subset 255,255,0 
15 #NOTE: editnextline ACTSITEx residues according to the protein 

Zone Subset ACTSITEx newmodel : 3 9 , 72 , 22 6 Static monomer/residue 8 
ColorjSubset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 
Combine Subset ALLZONEx Union ALLZONEx CTERMx 
2 0 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
Combine Subset RESTx Difference newmodel ALLZONEx 
List Subset RESTx Atom Output_File restxatom. list 
List Subset RESTx monomer/residue Output_File restxmole . list 
# 

25 Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
List Subset ACTSITEx Atom Output_File actsitexatom. list 
List Subset ACTSITEx monomer /residue Output_File actsitexmole . list 
# 

#read restxatom. list or restxmole . list to identify sites for 
30 (not_gluasp) ->gluasp . . . 
#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, none 
35 of which are solvent exposed. No further mutagenesis is required to 
obtain complete protection of the surface. However, it may be 
necessary to remove some of the reactive carboxylic groups in the 
active site region to ensure access to the active site of PD498. 
Acidic residues within the subset ACTSITE are: D39, D58, D68 and D106. 
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Of these only the two latter are solvent exposed and D3 9 is a 

functional residue. The mutations D68N, D68Q, D106N and D106Q were 

found suitable according to the present invention. 

Relevant data for Example 3 : 

5 Solvent accessibility data for PD498MODEL : see Example 1 above. 

Subset REST: 

restmole . list 
Subset REST: 

PD498FINALMODEL: 10-11 ,33 -35 ,54-55, 129-130,221,233-234,236,240,243, 
10 PD498FINALMODEL : 245 , 262 , 264-265 

restatom. list 

Subset REST: 

PD4 98FINALMODEL : ALA 10 : N, CA, C, O, CB 
1 5 PD4 9 8 FINALMODEL : TYR 1 1 : N , CA , C , 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 F I NALMODEL : GLN 3 3 : N , CA , C , 0 , CB , CG , CD , OE 1 , NE 2 

PD4 9 8 FINALMODEL : THR 3 4 : N , CA , C , 0 , CB , OG1 , CG2 

PD4 9 8 FINALMODEL : VAL 3 5 : N, CA, C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ILE 54 : N, CA, C , 0 , CB , CGI , CG2 , CD1 
20 PD4 98FINALMODEL : LYS 55 : N, CA, C, 0, CB , CG, CD, CE , NZ 

PD4 98 F INALMODEL : LYS 1 2 9 : N , CA , C , O , CB , CG , CD , CE , NZ 

PD49 8 FINALMODEL : VAL 1 3 0 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : TYR 221 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL: ALA 233 :N, CA, C, O, CB 
25 PD498FINALM0DEL : GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL: ALA 236 :N, CA, C, O, CB 

PD4 98FINALMODEL : ALA 240 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL : GLY 243:N,CA,C,0 

PD4 9 8 FINALMODEL : ASN 2 4 5 : N , CA , C , O , CB , CG , OD1 , ND2 
30 PD4 9 8 FINALMODEL : GLY 262:N,CA,C,0 

PD498FINALMODEL :GLY 264:N,CA,C,0 

PD4 9 8 FINALMODEL : THR 265:N,CA,C,0, CB, OG1 , CG2 
Subset SUB5B: 
sub5bmole . list 

3 5 Subset SUB5B : 

PD498FINALMODEL: 6 -9, 12-13, 31-32, 51-53, 56, 81,93-94,97-99, 122, 126-128, 
PD498FINALMODEL: 131, 155-157, 159, 197-199,209,211,219-220,232,235, 
PD498FINALMODEL: 23 7 -23 9, 241-242, 244, 246-249,253,260-261,263,266-268 
subSbatom. list 

4 0 Subset SUB5B: 

PD4 9 8 FINALMODEL : PRO 6 : N, CA, CD , C, 0, CB , CG 

PD4 9 8 FINALMODEL : TYR 7 : N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : TYR 8 : N, CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD49 8 FINALMODEL : SER 9 : N , CA , C , O , CB , OG 
45 PD498FI NALMODEL : GLN 1 2 : N , CA , C , O , CB , CG , CD , OE 1 , NE 2 

PD4 9 8 FINALMODEL : TYR 13 : N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : SER 3 1 : N , CA , C , O , CB , OG 

PD4 9 8 F INALMODEL : THR 3 2 : N , CA , C , O , CB , OG1 , CG2 

PD4 9 8 FINALMODEL : ARG 5 1 : N , CA , C , O , CB , CG , CD , NE , CZ , NH 1 , NH2 
50 PD498FINALMODEL :LYS 52 :N, CA, C, O, CB, CG, CD, CE,NZ 

PD4 9 8 FINALMODEL: VAL 53 :N, CA, C, O, CB, CGI , CG2 

PD498FINALMODEL :GLY 56:N,CA,C,0 

PD498FINALMODEL : ALA 81 : N, CA, C , O, CB 

PD498 F INALMODEL : MET 9 3 : N , CA , C , O , CB , CG , SD , CE 
55 PD4 9 8 FINALMODEL: ALA 94 : N, CA, C , O, CB 

PD4 9 8 FINALMODEL : THR 97 :N,CA,C,0, CB,0G1,CG2 
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PD 4 9 8 F I NALMODE L : LYS 
PD4 98FI NALMODEL : I LE 
PD4 98FINALMODEL : TYR 
PD 4 9 8 F I NALMODE L : GLN 
5 PD498FINALMODEL : GLY 
PD4 9 8 F I NALMODEL : ALA 
PD4 98 FINALMODEL : LEU 
PD4 9 8 F INALMODEL : GLY 
PD 4 9 8 F I NALMODE L : ALA 

10 PD4 9 8 FINALMODEL : VAL 
PD498FI NALMODEL : VAL 
PD4 98 FINALMODEL : TYR 
PD4 9 8 F INALMODEL : GLY 
PD4 98 FINALMODEL : THR 

1 5 PD4 98 FINALMODEL : ASN 
PD4 9 8 FINALMODEL : ALA 
PD4 98FINALMODEL : TYR 
PD4 9 8 FINALMODEL : SER 
PD4 9 8 F INALMODEL : VAL 

2 0 PD4 98 FINALMODEL : LEU 
PD4 9 8 FINALMODEL : ALA 
PD498FINALMODEL : LEU 
PD4 9 8 F INALMODEL : LEU 
PD 4 9 8 F I NALMODE L : SER 

2 5 PD4 9 8 FINALMODEL : GLN 

PD49 8 FINALMODEL : LYS 
PD4 9 8 FINALMODEL :ASN 
PD4 9 8 FINALMODEL : VAL 
PD49 8 FINALMODEL : GLN 

3 0 PD4 98 FINALMODEL : ILE 

PD4 9 8 FINALMODEL : ILE 
PD4 9 8 FINALMODEL : ILE 
PD4 9 8 FINALMODEL : SER 
PD498 FINALMODEL : THR 

3 5 PD4 9 8 F INALMODEL : ASN 

PD4 9 8 FINALMODEL : PHE 
PD4 9 8 F INALMODEL : LYS 
Subset ACTSITE: 

actsitemole . li 

4 0 Subset ACTSITE: 

PD4 9 8 FINALMODEL: 

164, 

PD498FINALMODEL: 
actsiteatom. li 

45 Subset ACTSITE: 

PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD498 FINALMODEL : 

50 PD4 9 8 FINALMODEL: 

PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL : 

55 PD4 9 8 FINALMODEL: 

PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 
PD4 9 8 FINALMODEL: 



9 8 : N , CA , C , 0 , CB , CG , CD , CE , NZ 
9 9 : N , CA , C , O , CB , CG 1 , CG2 , CD 1 



122 :N,CA 
126:N,CA 
127:N,CA 
128:N,CA 
131:N,CA 
155:N,CA 
156:N,CA 
157:N,CA 
159:N,CA 
197:N,CA 
198:N,CA 
199:N,CA 
209:N,CA 
211:N,CA 
219:N,CA 
220:N,CA 
232:N,CA 
235:N,CA 
237:N,CA 
238 :N,CA 
239:N,CA 
241:N,CA 
242 :N,CA 
244 :N,CA 
246 :N, CA 
247 :N,CA 
248 :N, CA 
249:N,CA 
253 :N,CA 
260 :N,CA 
261 :N,CA 
263 :N,CA 
266 :N,CA 
267 :N,CA 
268:N,CA 
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v- 1 


p 
vj 


p 
v- / 
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VJ 


p 
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p 
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c, 


0 


c, 


0 


c, 


0 


C, 


0 


C, 


0 


c, 


0 


c, 


0 


c, 


0 



CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
CB,CG,CD,OEl,NE2 

CB 

CB,CG,CD1,CD2 
CB 

CB / CG1 / CG2 
06,001,002 

CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

CB,0G1,CG2 
CB ,CG,0D1,ND2 
CB 

CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

CB,OG 

06,001,002 

CB / CG,CD1,CD2 

CB 

CB,CG,CD1,CD2 
CB,CG ,CD1,CD2 
CB,OG 

CB,CG,CD / 0E1,NE2 

CB,CG,CD,CE,NZ 

CB,CG,0D1,ND2 

CB,CG1,CG2 

CB,CG,CD,0E1,NE2 

CB,CG1 / CG2,CD1 

CB,CG1,CG2,CD1 

CB,CG1,CG2,CD1 

CB,OG 

CB,0G1,CG2 

CB,CG,0D1,ND2 

CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

CB,CG,CD,CE,NZ 



: 36-42, 57-60, 66-80, 100 -110, 115-116, 119, 132 -136, 160 - 

: 182 -184, 194,2 06-207,210,212-215,222-231 
st 

: ALA 36:N,CA,C,0,CB 

:VAL 37:N,CA,C,0,CB,CG1,CG2 

:LEU 38:N,CA,C,0,CB,CG,CD1,CD2 

:ASP 39:N,CA,C,0,CB,CG,ODl,OD2 

:SER 40 :N,CA,C,0,CB,OG 

: GLY 41:N,CA,C,0 

:VAL 42:N,CA,C,0,CB,CG1,CG2 

: TYR 57 :N, CA, C , 0, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

:ASP 58 :N,CA,C,0,CB,CG,0D1,0D2 

: PHE 59 : N, CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

:ILE 60 :N,CA,C,0,CB,CG1,CG2,CD1 

:PR0 66 :N,CA, CD, C,0,CB,CG 

: MET 67:N,CA,C,0,CB,CG,SD,CE 
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PD4 9 8 F I NALMODEL : AS P 6 8 : N , CA , C , O , CB , CG , OD 1 , OD2 
PD4 98FINALMODEL : LEU 69 :N, CA, C, O, CB, CG, CD1 , CD2 
PD498 F INALMODEL : ASN 7 0 : N , CA , C , O , CB , CG , OD 1 , ND 2 
PD4 98FINALMODEL :GLY 71:N,CA,C,0 
5 PD4 9 8 F I NALMODEL :HIS 72:N,CA,C,0,CB / CG,ND1,CD2 / CE1,NE2 

PD498FINALMODEL :GLY 73:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 74 : N, CA, C, O, CB , OG1 , CG2 
PD4 98FINALMODEL : HIS 75 : N, CA, C, O, CB , CG , ND1 , CD2 , CE1 , NE2 
PD4 9 8 FINALMODEL : VAL 76 :N, CA, C, O, CB, CGI , CG2 
10 PD4 9 8 FINALMODEL: ALA 77 : N, CA, C, O, CB 

PD4 9 8 FINALMODEL : GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 79 : N, CA, C, O, CB, OG1 , CG2 
PD4 9 8 F I NALMODEL : VAL 8 0 : N , CA , C , O , CB , CG 1 , CG2 





Pn/1 QQ t?TXTAT MHriPT . T T7TT 

JrJJ4 yor XJNAXj1 v JvJJJEj1j . IjEiU 




. XT 

: jm 


pa r* 
v_A , U j 


r\ 
V t 


pd pm pno 

Ld , LVj , LJJX , LJJ^s 




•fjjfi yor xiNAXu v ivjjjJiij : AXiA 


i ni 

1U1 


. XT 

: jm 


PA P 


r\ 
U t 


Ld 




JrJJ4 yor X JMAJjJvjujJej Jb : V AXj 


i no 


. XT 

: jn 


pa P 


r\ 
V t 


PT3 PP1 PPO 
Lr5 , LvjX , LLj^ 




trusty or 1 JM ALimUUciLi : AKLj 


i n o 


* "NT 

: JM 


PA P 
LA, L , 


U , 


PD PP PFl XTT? P«7 XTTJ1 XTTJO 

Lo , LLj, LJJ , JMJi , LZ» , JMrll , JMrl^s 




DH/l Q Q T7T\T7\ T MHTlTTT . T7AT 

JrJJ4 yor x JMAijJnUJJJiXi : val 


XU4 


: JM i 


PA P 


LJ / 


PD Pr^T PPO 
Ld , LL?X , LLj^S 




DTlA Q Q ITTXTAT MPnTTT . T T7TT 

JrJJ4 yor XNAXiMvjJJEjJb : JjEjU 




. XT 

: JN 


PA P 
r LA , L , 


r\ 


PD PP PT\T PFvO 
Lb , LVjr , LUX , LJJZ 


o n 
z. u 


Jr\U4 y o r X JNAXjMUJJJiXi : Ab F 


XUo 


. XT 

: JM 


PA n 
LA , L , 


r\ 
V i 


pd rr nm nm 
Ld , Lvj , UJJX , UUZ 




Driil Q Q CTXTAT MPVn'E'T .ATA 

JrJJ4 y or XlMAXjJ v JUJJrjXj : A 1 iA 


1U / 


. XT 

: JM 


PA P 

LA, L , 


r\ 
V t 


PD 

L£5 




DTJA Qfl TTTXTAT MP/HTTT . a CM 
JrJJ4yor ±1MAXjJ v JvJJJI1iXi : AoJM 


i n q 


. XT 

: jm , 


Pa P 
LA, L , 


L' t 


PD PP PFll XTT^O 
Ld , Lvj , UJJX , IMJJZ 




Jr JJ4 yor x jmajjMUJJei j_i : l*Jj x 


i n q 

jlu y 


. XT 

: JM 


CA,C, 


0 






DPiA Q Q T7TXTAT MrVHIn'T . CT7D 

f uft yor J. jmaxjI v jujj h ij : oJiK 




. XT 

: jm 


CA,C, 


o, 


CB,0G 


9 ^ 
Z D 


JrJJ4 yor XJMAXiJ v JUJJrjjj : oHK 


lie 
XXO 


. XT 

: JM j 


CA,C, 


o, 


CB,0G 




DT^/l QQ'CTXTAT MPriTTT . TT IT 

JrJJ4yor XJMAXjJviUJJJiXi : XJ_iJi 


lib 


• XT 

: JM i 


CA,C, 


o, 


CB,CG1,CG2 # CD1 




DPiA O Q T7TXTAT MP/nTT . PT V 

Jr JJ4 yor ± JMAljJnUJJJiXi : IjXi x 


1 1 Q 

xxy 


. XT 

: JM 


CA,C ( 


0 






FJJ4 yor X J>JAXjJYJUJJr.jXi : AbJM 


TOO 


. XT 

: JM t 


CA,C, 


o, 


CB,CG,0D1,ND2 




JrJJ4yor XJNAXjIYIUJJeiXi : XiJiU 


133 


:N, 


CA,C, 


o, 


CB,CG,CD1,CD2 


o n 


DF\/1 Q O T? T "NT 7\ T 1V>TP,TIT?T . CCD 

FU4yor ±NA±jM\juh*L : bbK 


134 


:N, 


CA,C, 


o, 


CB,0G 




f)T\ A OODTXTAT MAr\UT - T "CTT 

FJJ4 y or INALMUJJrjL : LEU 


135 


:N, 


CA,C, 


o, 


CB,CG,CD1,CD2 




PD4yor XNALMODhL : GLY 


136 


:N, 


CA,C, 


0 






FJJ4yor XNALMUDIiL : ALA 


160 


:N, 


CA,C, 


o, 


CB 




FJJ4yor XNAljMUJJJiXi : ALA 


161 


:N, 


CA,C, 


o, 


CB 


o a 


nn A QOT7TKTAT MPTlCT . 7\T 7\ 

FJJ4 yor X JNAXjJYlUJJiiiXi : AXiA 


162 


:N, 


CA,C, 


o, 


CB 




T)T>i A Q OCT MAT lV>fPT^'CT . PT V 

FJJ4 y or XJMAXjMUJJiliij : LLY 


163 


:N, 


CA,C, 


0 






FJJ4 yor INALJYIOJjbL : AbN 


164 


:N, 


CA,C, 


o, 


CB,CG,0D1,ND2 




FD4yor INALMODhiL : VAL 


182 


:N, 


CA,C, 


o, 


CB,CG1,CG2 




VDA QP T7TXTAT MPiT"iT?T • PT V 
rMJ4 yor XlMAXuYlUJJJilj : KjLi X 


183 


:N, 


CA,C, 


0 




40 


PD4 9 8 FINALMODEL : ALA 


184 


:N, 


CA,C, 


o, 


CB 




PD4 9 8 F INALMODEL : PHE 


194 


:N, 


CA,C, 


o, 


CB, CG, CD1 , CD2 , CE1 , CE2 , 




PD4 98 FINALMODEL : PRO 


206 


:N, 


CA,CD,C,0,CB,CG 




PD4 9 8 FINALMODEL : GLY 


207 


:N, 


CA,C, 


0 






PD4 98FINALMODEL: ILE 


210 


:N, 


CA,C, 


o, 


CB,CG1,CG2,CD1 


45 


PD498FINALMODEL : SER 


212 


:N, 


CA,C, 


o, 


CB,0G 




PD4 9 8 FINALMODEL : THR 


213 


:N, 


CA,C, 


o, 


CB,0G1,CG2 




PD4 9 8 FINALMODEL : VAL 


214 


:N, 


CA,C, 


o, 


CB,CG1,CG2 




PD4 9 8 FINALMODEL: PRO 


215:N, 


CA,CD,C 


:,o,cb,cg 




PD4 9 8 FINALMODEL : MET 


222 


:N, 


CA,C, 


o, 


CB,CG,SD,CE 


50 


PD4 98FINALMODEL : SER 


223 


:N, 


CA,C, 


o, 


CB,0G 




PD4 98 FINALMODEL : GLY 


224 


:N, 


CA,C, 


0 






PD4 9 8 F INALMODEL : THR 


225 


:N, 


CA,C, 


o, 


CB,0G1,CG2 




PD4 9 8 FINALMODEL : SER 


226 


:N, 


CA,C, 


o, 


CB,0G 




PD4 9 8 FINALMODEL : MET 


227 


:N, 


CA,C, 


o, 


CB,CG,SD,CE 


55 


PD4 9 8 FINALMODEL : ALA 


228 


:N, 


CA,C, 


o, 


CB 




PD4 9 8 FINALMODEL : SER 


229 


:N, 


CA,C, 


o, 


CB, 0G 




PD4 9 8 FINALMODEL: PRO 


230 


:N, 


CA,CD,C,0,CB,CG 




PD4 9 8 FINALMODEL : HIS 


231 


:N, 


CA,C, 


o, 


CB , CG, ND1 , CD2 , CE1 , NE2 
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Subset RESTx: 

restxmole . list 
Subset RESTX: 

NEWMODEL:233-234 
5 restxatom. list 

Subset RESTX: 

NEWMODEL : ALA 233 :N, CA, C, 0, CB 
NEWMODEL : GLY 234:N,CA,C,0 

10 Example 4 

Suitable substitutions in the Arthromyces ramosus peroxidase for 
addition of carboxylic acid attachment groups (-COOH) 

Suitable locations for addition of carboxylic attachment groups 
(aspartatic acids and glutamic acids) in a non-hydrolytic enzyme, 
15 Arthromyces ramosus peroxidase were found as follows. 

The 3D structure of this oxido-reductase is available in the 
Brookhaven Databank as larp.pdb. This A. ramosus peroxidase contains 
344 amino acid residues. The first eight residues are not visible in 
the X-ray structure: QGPGGGGG, and N143 is glycosylated. 
2 0 The procedure described in Example 1 was followed. 

The amino acid sequence of Arthromyces ramosus Peroxidase 
(E.C.I. 11. 1.7) is shown in SEQ ID NO: 4. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeDEzone .bcl and makeDEzone2 .bcl below. The C- 

2 5 terminal residue is P344, the ACTSITE is defined as the heme group and 

the two histidines coordinating it (H56 & H184) . 
Conservative substitutions: 
makeDEzone. bcl 
Delete Subset * 

3 0 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255, 0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

35 #NOTE: editnextline C-terminal residue number according to the protein 
Zone Subset CTERM :344:0 Static monomer /residue 10 Color_Subset 
255,255, 0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer /residue 8 Color_Subset 
40 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 
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Combine Subset ALLZONE Union ALLZONE ACTSITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output_File restatom. list 
5 List Subset REST monomer/residue Output_File restmole . list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output_File actsiteatom. list 
List Subset ACTSITE monomer/ residue Output_File actsitemole . list 
# 

10 Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB 5 A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File subSbatom. list 
15 List Subset SUB5B monomer /residue Output_File sub5bmole . list 

#Now identify sites for asn->asp & gln->glu substitutions and ... 
#continue with makezone2 .bcl . 

#Use grep command to identify asn/gln in restatom. list ... 
#sub5batom. list & accsiteatom. list 

20 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, Asnl28, 
Asn303 all of which are solvent exposed. The substitutions Q34E or 
Q34D, Q70E or Q70D, N128D or N128E and N303D or N303E are identified 
25 in A. ramosus peroxidase as sites for mutagenesis. Residues are 
substituted below and further analysis done: 

Non- conservative substitutions: 
makeDEzone2 .bcl 

30 #sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

#having scanned lists (grep gln/asn command) and identified sites for 

#asn->asp & gln->glu substitutions 
35 #NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newmodel 
Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On ARP 
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#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel:34 glu L 
Replace Residue newmodel:70 glu L 
Replace Residue newmodel:128 asp L 
5 Replace Residue newmodel : 3 03 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, . . . 
#now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
10 Zone Subset ASPx newmodel : asp : od* Static monomer/residue 10 
Color_Subset 255,255,0 

Zone Subset GLUx newmodel : glu :oe* Static monomer/ residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline C-terminal residue number according to the protein 
15 Zone Subset CTERMx newmodel : 344 :0 Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 

Zone Subset ACTSITEx newmodel : HEM, 56 , 184 Static monomer /residue 8 

Color_Subset 255,255,0 

2 0 Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 
Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
Combine Subset RESTx Difference newmodel ALLZONEx 
List Subset RESTx Atom Output_File restxatom. list 
25 List Subset RESTx monomer/ residue Output_File restxmole . list 
# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
List Subset ACTSITEx Atom Output_File actsitexatom. list 
List Subset ACTSITEx monomer/ residue Output_File actsitexmole . list 
30 # 

#read restxatom. list or restxmole . list to identify sites for 
(not__gluasp) ->gluasp . . . 
#subst. if needed 

3 5 Comments: 

The subset RESTx contains only four residues: S9, S334, G335 and 
P336, all of which are >5% solvent exposed. The mutations S9D, S9E, 
S334D, S334E, G335D, G335E, P336D and P336E are proposed in A. ramosus 
peroxidase. Acidic residues within the subset ACTSITE are: E44, D57, 



D77, E87, E176, D179, E190 # D202, D209, D246 and the N-terminal 
carboxylic acid on P344 . Of these only E44, D77, E176, D179, E190, 
D209, D246 and the N-terminal carboxylic acid on P344 are solvent 
exposed. Suitable sites for mutations are E44Q, D77N, E176Q, D179N, 
5 E190Q, D209N and D246N. D246N and D246E are risky mutations due to 
D246's importance for binding of heme. 

The N-terminal 8 residues were not included in the calculations 
above, as they do not appear in the structure. None of these 8 
residues, QGPGGGG, contain carboxylic groups. The following variants 
10 are proposed as possible mutations to enable attachment to this 
region: Q1E, Q1D, G2E, G2D, P3E, P3D, G4E, G4D, G5E, G5D, G6E, G6D, 
G7E, G7D, G8E, G8D. 

Relevant data for Example 4 : 

Solvent accessibility data for A. ramosus peroxidase (Note: as 
15 the first eight residues are missing in the X-ray structure, the 
residue numbers printed in the accessibility list below are 8 lower 
than those used elsewhere for residue numbering. 
# ARP Thu Jan 30 15:39:05 MET 1997 





# residue 


area 


20 


SER 1 


143 .698257 




VAL 2 


54.879990 




THR 3 


86.932701 




CYS 4 


8.303715 




PRO 5 


126.854782 


25 


GLY 6 


53 .771488 




GLY 7 


48.137802 




GLN 8 


62 .288475 




SER 9 


79.932549 




THR 10 


16.299215 


30 


SER 11 


81.928642 




ASN 12 


51.432678 




SER 13 


81.993019 




GLN 14 


92 .344009 




CYS 15 


0.000000 


35 


CYS 16 


32 .317432 




VAL 17 


54.067810 




TRP 18 


6.451035 




PHE 19 


25.852070 




ASP 2 0 


79.033997 


40 


VAL 21 


0.268693 




LEU 22 


22.032858 




ASP 23 


90 .111404 




ASP 24 


43 .993240 




LEU 25 


1.074774 


45 


GLN 2 6 


25.589321 




THR 27 


82.698059 




ASN 2 8 


96.600883 




PHE 2 9 


32 .375275 




TYR 3 0 


5.898365 


50 


GLN 31 


103 .380585 
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GLY_32 
SER_33 
LYS__34 
CYS_35 
5 GLU_36 
SER_37 
PRO_3 8 
VAL_3 9 
ARG_40 

10 LYS_41 
ILE_42 
LEU_43 
ARG_44 
ILE_45 

15 VAL_46 
PHE_47 
HIS_48 
ASP_49 
ALA_50 

20 ILE_51 
GLY_52 
PHE_53 
SER_54 
PRO_55 

25 ALA_56 
LEU_57 
THR_58 
ALA__59 
ALA_60 

30 GLY_61 
GLN_62 
PHE_63 
GLY_64 
GLY_65 

35 GLY_66 
GLY_67 
ALA_68 
ASP_69 
GLY_70 

40 SERJ71 
ILEJ72 
ILE_73 
ALA_74 
HISJ75 

4 5 SER_76 
ASNJ77 
ILE_78 
GLU_79 
LEU_80 

50 ALA_81 
PHE_82 
PRO_83 
ALA_84 
ASN_85 

55 GLY_86 
GLY_87 
LEU_88 
THR 89 



40.042034 

46.789322 

87.161873 

12.827215 

51.582657 

16.378180 

33.560043 

6.448641 

7.068311 

15.291286 

1.612160 

1.880854 

16.906845 

0.000000 

2 .312647 

2.955627 

20 .392527 

4.238116 

0 .510757 

1.576962 
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Subset REST: 

restmole . list 

Subset REST: 

ARP: 9, 69-70,125, 127, 133,2 99-301,334-336 

25 restatom. list 

Subset REST: 

ARP : SER 9 :N # CA, C # O, CB , OG 

ARP : GLY 69:N,CA,C,0 

ARP : GLN 70 : N, CA, C, O, CB , CG, CD, OE1 , NE2 
30 ARP : GLY 125:N,CA,C,0 

ARP : SER 127 : N, CA, C, 0, CB , OG 
ARP:PRO 133 :N, CA, CD, C, O, CB, CG 
ARP : SER 299 :N, CA, C, 0, CB, OG 
ARP : ALA 300 :N, CA, C, 0, CB 

3 5 ARP : VAL 301 :N, CA, C, 0, CB , CGI , CG2 

ARP : SER 334 :N, CA, C, O, CB, OG 

ARP : GLY 335:N,CA,C,0 

ARP:PRO 336:N,CA,CD,C,0,CB,CG 
Subset SUB5B: 
40 subSbmole . list 

Subset SUB5B: 

ARP: 10 -11, 34, 3 8, 65-68, 71-72, 120-121, 123-124, 128-132, 134,270,274, 
ARP: 2 97-298, 3 02-303, 3 11 -3 12, 332 -333, 337-338 
subSbatom. list 

4 5 Subset SUB5B: 

ARP: VAL 10 : N, CA, C , O, CB , CGI , CG2 
ARP : THR 11 : N, CA, C, O, CB , OG1 , CG2 
ARP : GLN 34 : N, CA, C , O, CB , CG, CD, OE1 , NE2 
ARP : TYR 3 8 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
50 ARP : LEU 65 :N, CA, C, 0, CB, CG, CD1 , CD2 

ARP: THR 66 : N, CA, C, 0, CB , OG1 , CG2 
ARP : ALA 67 :N, CA, C, O, CB 
ARP : ALA 68 : N, CA, C , O, CB 

ARP : PHE 71 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 
55 ARP : GLY 72:N,CA,C,0 

ARP: PHE 120 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 
ARP : ALA 121 : N, CA, C, 0, CB 
ARP : ALA 123 : N, CA, C, 0, CB 
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ARPrVAL 124 : N, CA, C, O, CB , CGI , CG2 
ARP : ASN 128 :N, CA, C, O, CB, CG, OD1 , ND2 
ARPrCYS 129:N,CA,C,0,CB,SG 
ARP : PRO 130:N,CA,CD,C,O,CB,CG 
5 ARP : GLY 131 :N, CA, C, O 

ARP : SER 132 :N, CA, C, O, CB, OG 

ARP : ARG 134 :N, CA, C, O, CB, CG, CD, NE, CZ, NH1 , NH2 
ARP : GLY 270:N,CA,C,O 

ARP: ARG 274 :N, CA, C, O, CB, CG, CD, NE, CZ , NH1 , NH2 
10 ARP : ILE 297 :N, CA, C, O, CB, CGI , CG2 , CD1 

ARP:PRO 298:N,CA,CD,C,0,CB,CG 

ARP : SER 3 02 : N, CA, C , O , CB , OG 

ARP : ASN 303 :N, CA, C, O, CB, CG, OD1 ,ND2 

ARP : GLY 311:N,CA,C,0 
15 ARP : GLY 312:N,CA,C,0 

ARP : THR 332 :N, CA, C, O, CB, OG1 , CG2 

ARP : ALA 333 :N, CA, C, O, CB 

ARP: LEU 337 :N, CA, C, O, CB, CG, CD1 , CD2 

ARP-.PRO 338:N,CA,CD,C,0,CB,CG 
2 0 Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

ARP: 44-61, 75-77, 79-80, 87-88, 90-96, 99, 118, 122,126, 135,148-149,152- 

158, 

25 ARP: 163 -164 ,167, 176 -194, 197-205,207-20 9, 211-213,216,230-231,241, 

ARP : 243 -246, 249, 259, 273, 2 77, 280, 343 -347H 
actsiteatom. list 
Subset ACTSITE: 

ARP : GLU 44 :N, CA, C, O , CB, CG, CD, OE1 , OE2 
30 ARP : SER 45 :N, CA, C, O, CB, OG 

ARP:PRO 46:N,CA,CD,C,0,CB,CG 

ARP : VAL 47 : N, CA, C, O, CB, CGI , CG2 

ARP: ARG 48 : N, CA, C, O, CB, CG, CD, NE , CZ , NH1 , NH2 

ARP : L YS 4 9 : N , CA , C , O , CB , CG , CD , CE , NZ 
35 ARP : ILE 50 : N, CA, C, O, CB , CGI , CG2 , CD1 

ARP : LEU 51 :N, CA, C, O, CB, CG, CD1 , CD2 

ARP: ARG 52 :N, CA, C, O, CB, CG, CD,NE, CZ,NH1 ,NH2 

ARP : ILE 53 :N, CA, C, O, CB, CGI , CG2 , CD1 

ARP : VAL 54 :N, CA, C, O, CB, CGI , CG2 
4 0 ARP : PHE 55 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 

ARP: HIS 56:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

ARP:ASP 57:N,CA,C,0,CB,CG,ODl,OD2 

ARP : ALA 58 : N, CA, C, 0, CB 

ARP : ILE 5 9 : N , CA , C , 0 , CB , CGI , CG2 , CD1 
45 ARP : GLY 60:N,CA,C,O 

ARP: PHE 61:N,CA,C,0, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 

ARP : GLY 75:N,CA,C,0 

ARP : ALA 76 : N, CA, C, O, CB 

ARP:ASP 77:N,CA,C,0,CB,CG,ODl,OD2 
50 ARP : SER 79 :N, CA, C, 0, CB, OG 

ARP : ILE 80:N,CA,C,O,CB,CGl,CG2,CDl 

ARP: GLU 87 : N, CA, C, 0, CB, CG, CD, OE1 , OE2 

ARP: LEU 88 :N, CA, C, O, CB, CG, CD1 , CD2 

ARP : PHE 9 0 : N , CA , C , O , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 
55 ARP:PRO 91:N,CA,CD,C,0,CB,CG 

ARP : ALA 92 : N, CA, C, 0, CB 

ARP : ASN 93 : N, CA, C, O, CB, CG, OD1 , ND2 

ARP : GLY 94:N,CA,C,0 
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ARP:GLY 95:N,CA,C,0 

ARP : LEU 96 : N, CA, C, O, CB , CG, CD1 , CD2 

ARP : THR 99 :N, CA, C, 0, CB , 0G1 , CG2 
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APP • VAT. 


X / / • IN | 


PA 


n 

\~ i 


n pr ppi pn9 

r \J , LD , Lul , LbZ 




APP • VAT, 


1 7R • TJ 

X / O . IN i 


PA 


c 


f 0,CB,CGl,CG2 




APP • A^P 


1 7Q • TJ 

X / J . IN f 


PA 


c 


0,CB,CG,0D1,0D2 




APP • T.PTT 


i ft n • nt 

lOU . IN i 


PA 


c 


0,CB,CG,CD1,CD2 


Z J 


APP • T.PTT 


1 ft 1 • NF 

X O X . IN | 


PA 


c 


0,CB,CG,CD1 # CD2 




APP • AT, A 


1 ft 9 • TJ 

X O Z . IN / 


PA 


c 


f 0,CB 




APP • AT, A 


1 R7 • TJ 

IOj . IN f 


PA 


c 


r 0,CB 






1 ft4 - TJ 

lOt .IN t 


PA 


c 


0 , CB , CG , ND1 , CD2 , CE 1 , NE2 




APP • QT?D 


1 OCT . XT 
lO J . IN i 


PA 


c 


O/CB.OG 


9 n 


AP P • T TTTT 


lOD . I N , 


PA 


c 


0^6,00,001^02 




APP • AT.A 


1 ft 7 • NT 
X O / . IN / 


PA 


c 


0,CB 






1 ft ft • NT 

loo : in , 


PA 


c 


f 0,CB,0G 




APP • PT.NT 


1 ft Q • NT 
IOJ . IM f 


PA 


c 


f 0,CB,CG,CD,0El,NE2 




AP P • PT.TT 


i q n . nt 

Xi?U . IN , 


PA 


c 


0 / CB,CG,CD,0El,0E2 


9 ^ 


AP P • PT.V 


1 Q 1 • NT 
X y X • In / 


PA 


c 


0 




APP • T.T7TT 


1 Q9 • NT 
x z . in | 


PA 


c, 


0,CB,CG,CD1,CD2 




APP • AQNT 


1 Q 7 • NT 


PA 


c, 


0,CB,CG,0D1,ND2 




App . CT?P 


1 .NT 

x y *± : in j 


PA 


c 


r 0,CB,0G 




APP • PT4TT 


1 Q 7 • NT 
x y / . in # 


PA 


c 


r 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 


4. n 

*± U 


AP P • APn 


1 Q ft - NT 
l yo . in , 


PA 


c 


0,CB,CG,CD,NE / CZ,NH1,NH2 




APP • QT?P 


1 Q Q • NT 

±yy . in / 


PA 


c, 


0,CB,0G 




APD • DPH 


9 n n - nt 

Z U U . IN , 


PA 


CD,C,0,CB,CG 




APD . T T?TT 


o n i .xj 
z u x : IM , 


PA 




0,CB,CG,CD1,CD2 




AP P . AGP 


Onn .xt 
z UZ : in , 




C 


0,CB,CG,0D1,0D2 




APP . CT7P 


9 n *2 .NT 

z u J : N 7 


PA 


c 


0,CB,0G 




APD. TUD 


Oftyl .KT 


n A 




0,CB,0G1,CG2 




APP . DPO 


9 n cr « xt 

z u o : in / 


PA 
LA/ 


CD,C,0,CB,CG 




APP . T7AT 


o n 7 - XT 

z u / : in / 


r*A 




0,CB,CG1,CG2 




APP • PRF 


90ft • TJ 

Zj \J O . IN y 


PA 


c, 


0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 


50 


ARP: ASP 


209:N, 


CA, 


C 


0,CB,CG,0D1,0D2 




ARP : GLN 


211:N, 


CA, 


c, 


0,CB,CG,CD,0E1,NE2 




ARP : PHE 


212:N, 


CA, 


c, 


0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 




ARP : TYR 


213 :N, 


CA 7 


c, 


0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 




ARP : THR 


216^, 


CA, 


c, 


0,CB,0G1,CG2 


55 


ARP : PHE 


230:N, 


CA, 


c, 


0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 




ARP: ALA 


231:N, 


CA, 


c, 


0,CB 




ARP : PHE 


241:N # 


CA, 




0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 




ARP: MET 


243 :N, 


CA, 




0,CB,CG,SD,CE 
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ARP : ARG 244 : N, CA # C , O, CB , CG, CD , NE , CZ , NH1 , NH2 
ARP:SER 245 :N,CA,C,0,CB,OG 
ARP : ASP 2 4 6 : N , CA ,0,0, CB , CG , 0D1 , 0D2 
ARP : LEU 2 4 9 : N , CA , C , O , CB , CG , CD 1 , CD2 
5 ARP : TRP 25 9 : N , CA, C , O , CB , CG, CD1 , CD 2 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 

ARP : TYR 273 :N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
ARP : MET 277 :N, CA, C, O, CB, CG, SD, CE 
ARP : MET 280 :N, CA, C, O, CB, CG, SD, CE 
ARP : ALA 343 :N, CA, C, O, CB 
10 ARP : PRO 344:N / CA,CD,C,0,OXT,CB,CG 

ARP : HEM 3 4 5H : FE , NA , NB , NC , ND , CHA , CHB , CHC , CHD , CI A , C2 A , C3A, C4 A , CMA , 
CAA, CBA, CGA 

ARP : HEM 34 5H : 01A, 02A, C1B , C2B , C3B , C4B , CMB , CAB , CBB , C1C , C2C, C3C, C4C , 
CMC,CAC,CBC 

1 5 ARP : HEM 3 4 5H : C1D , C2D, C3D, C4D , CMD , CAD , CBD , CGD , 01D , 02D 

ARP : CA 346H:CA 
ARP:CA 347H:CA 
Subset RESTx: 

restxmole .list 
20 Subset RESTX 

NEWMODEL :9, 334-336 
restxatom. list 
Subset RESTX: 

NEWMODEL : SER 9 :N, CA, C, O, CB, OG 
25 NEWMODEL : SER 334 :N, CA, C, 0, CB, OG 

NEWMODEL :GLY 335:N,CA,C,0 
NEWMODEL: PRO 33 6 : N, CA, CD, C, 0, CB , CG 

Example 5 

3 0 Activation of mPEG 15,000 with N-succinimidyl carbonate 

mPEG 15,000 was suspended in toluene (4 ml/g of mPEG) 20% was 
distilled off at normal pressure to dry the reactants azeotropically . 
Dichloromethane (dry 1 ml/g mPEG) was added when the solution was cooled 
to 30°C and phosgene in toluene (1.93 M 5 mole/mole mPEG) was added and 
35 mixture stirred at room temperature overnight. The mixture was 
evaporated to dryness and the desired product was obtained as waxy 
lumps . 

After evaporation dichloromethane and toluene (1:2, dry 3 ml/g 
mPEG) was added to re-dissolve the white solid. N-Hydroxy succinimide (2 

4 0 mole/mole mPEG) was added as a solid and then triethylamine (1.1 

mole/mole mPEG) . The mixture was stirred for 3 hours, initially unclear, 
then clear and ending with a small precipitate. The mixture was 
evaporated to dryness and recrystallized from ethyl acetate (10 ml) with 
warm filtration to remove salts and insoluble traces. The blank liquid 
4 5 was left for slow cooling at ambient temperature for 16 hours and then 
in the refrigerator overnight. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % (w/w) . 
NMR Indicating 80 - 90% activation and 5 o/oo (w/w) HNEt 3 Cl . *H-NMR for 
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mPEG 15,000 (CDC1 3 ) d 1.42 t (1= 4.8 CH 3 i HNEt 3 Cl) , 2.84 s (1= 3.7 
succinimide) , 3.10 dq (1= 3.4 CH 2 i HNEt 3 Cl) , 3.38 s (1= 2.7 CH 3 i OMe) , 
3.40* dd (I = 4.5 o/oo, 13 C satellite), 3 . 64 bs (I = 1364 main peak), 
3.89* dd (I = 4.8 o/oo , 13 C satellite), 4.47 dd (I = 1.8, CH 2 in PEG). 
5 No change was seen after storage in a desiccator at 22°C for 4 months. 

Example 6 

Activation of mPEG 5,000 with N-succinimidyl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
10 performed as described in Example 5. 

Example 7 

Construction and expression of PD498 variants : 

PD498 site-directed variants were constructed using the "maxi- 
15 oligonucleotide-PCR" method described by Sarkar et al . , 1990, 
BioTechniques, 8, 404-407. 

The template plasmid was shuttle vector pPD498 or an analogue of 
this containing a variant of the PD498 protease gene. 

The following PD498 variants were constructed, expressed and 
2 0 purified. 



A: 


R28K 


B: 


R62K 


C: 


R169K 


D: 


R28K+R62K 


25 E: 


R28K+R169K 


F: 


R62K+R169K 


G: 


R28K+R69K+R169K 



Construction of variants 
3 0 For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA GCA CTC 

AAA CG (SEQ ID NO: 7) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 
3 5 recognized by Styl digestion and verified by DNA sequencing of the total 

769 bp insert. 

For introduction of the R62K substitution a synthetic 
oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO : 8) was used. 
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A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the total 
769 bp insert. 

5 For introduction of the R169K substitution a synthetic 

oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO: 9) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 

10 recognized by the absence of an Rsa I restriction site and verified by 
DNA sequencing of the total 769 bp insert. 

For simultaneous introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence GGG ATG 
TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO: 7) and the sequence CGA 

15 CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO: 8) were used simultaneously. 
A PCR fragment of 769 bp was ligated into the pPD498 plasmid prepared by 
Bst E II and Bgl II digestion. Positive variants were recognized by Styl 
and Clal digestion and verified by DNA sequencing of the total 769 bp 
insert . 

2 0 For simultaneous introduction of the R28K and the R169K 

substitutions, synthetic oligonucleotides having the sequence GGG ATG 
TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO: 7) and the sequence CAA 
TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO: 9) were used simultaneously. 
A PCR fragment of 769 bp was ligated into the pPD498 plasmid prepared by 
25 Bst E II and Bgl II digestion. Positive variants were recognized by Styl 
digestion and absence of an Rsa I site. The variant was verified by DNA 
sequencing of the total 769 bp insert. 

For simultaneous ntroduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence CGA CTT 

3 0 TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO: 8) and the sequence CAA TGT ATC 

CAA AAC GTT CCA ACC AGC (SEQ ID NO: 9) were used simultaneously. A PCR 
fragment of 769 bp was ligated into the pPD498 plasmid prepared by Bst E 
II and Bgl II digestion. Positive variants were recognized by Clal 
digestion and absence of an Rsa I site. The variant was verified by DNA 
35 sequencing of the total 769 bp insert. 

For simultaneous introduction of the R28K, the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence GGG ATG 
TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO: 7) , the sequence CGA CTT 
TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO: 8) and the sequence CAA TGT ATC 



84 



I 



CAA AAC GTT CCA ACC AGC (SEQ ID NO: 9) were used simultaneously. A PCR 
fragment of 769 bp was ligated into the pPD498 plasmid prepared by Bst E 
II and Bgl II digestion. Positive variants were recognized by Styl and 
Clal digestion and absence of an Rsa I site. The variant was verified by 
5 DNA sequencing of the total 769 bp insert. 

Fermentation, expression and purification of PD498 variants 

Vectors hosting the above mentioned PD498 variants were purified 
from E. coli cultures and transformed into B. subtilis in which organism 
10 the variants were fermented, expressed and purified as described in the 
"Materials and Methods" section above. 

Example 8 

Conjugation of triple substituted PD498 variant with activated mPEG 
15 5,000 

200 mg of triple substituted PD498 variant (i.e. the 
R28K+R62K+R169K substituted variant) was incubated in 50 mm NaBorate, pH 
10, with 1.8 g of activated mPEG 5,000 with N-succinimidyl carbonate 
(prepared according to Example 2), in a final volume of 20 ml. The 
2 0 reaction was carried out at ambient temperature using magnetic stirring. 
Reaction time was 1 hour. The reaction was stopped by adding DMG buffer 
to a final concentration of 5 mM dimethyl glutarate, 1 mM CaCl 2 and 50 
mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approximately 

2 5 120 kDa, corresponding to about 16 moles of mPEG attached per mole 

enzyme . 

Compared to the parent enzyme, residual activity was close to 100% 
towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p-Nitroanilide) . 

3 0 Example 9 

Allergenicity trials of PD498 variant-SPEG 5,000 in guinea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 microgram 
PD498-SPEG 5,000 and 1.0 microgram modified variant PD498-SPEG 5,000 by 
intratracheal installation . 
3 5 Sera from immunized Dunkin Hartley guinea pigs are tested during 

the trial period in a specific IgGi ELISA (described above) to elucidate 
whether the molecules could activate the immune response system giving 
rise to a specific IgGi response indicating an allergenic response. 
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The IgGx levels of Dunkin Hartley guinea pigs during the trial 
period of 10 weeks are observed. 

Example 10 

5 Suitable substitutions in Humicola lanuginosa lipase for addition of 
amino attachment groups ( -NH 2 ) 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO: 6) is 
available in Brookhaven Databank as ltib.pdb. The lipase consists of 
269 amino acids. 

10 The procedure described in Example 1 was followed. The sequence 

of H. lanuginosa lipase is shown below in the table listing solvent 
accessibility data for H. lanuginosa lipase. H. lanuginosa residue 
numbering is used (1-269) , and the active site residues (functional 
site) are S146, S201 and H258. The synonym TIB is used for H. 

15 lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone .bcl and makeKzone2 .bcl below: 



Conservative substitutions: 
20 makeKzone. bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer /re si due 10 Color_Subset 
255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer /residue 10 ColorJSubset 
255,255, 0 

5 #N0TE: editnextline ACTSITE residues according to the protein 

6 Zone Subset ACTSITE : 146, 2 01, 258 Static monomer/ residue 8 
ColorjSubset 255,255,0 

3 0 7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 

10 Combine Subset REST Difference TIB ALLZONE 

11 List Subset REST Atom Output_File restatom. list 

35 12 List Subset REST monomer/residue Output_File restmole . list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output_File actsiteatom. list 

15 List Subset ACTSITE monomer/ residue Output_File actsitemole . list 

16 # 
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17 Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 

18 Combine Subset SUB5A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 255,255,255 

21 List Subset SUB5B Atom Output_File subSbatom. list 

22 List Subset SUB5B monomer /residue Output_File sub5bmole . list 

23 #Now identify sites for lys->arg substitutions and continue with 
makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list , 
sub5batom. list & accsiteatom. list 

Comments : 

In this case of H. lanuginosa (=TIB) , REST contains the arginines 
Argl33, Argl39, Argl60, Argl79 and Arg 209, and SUB5B contains Argll8 
and R125. 

These residues are all solvent exposed. The substitutions R133K, 
R139K, R160K, R179K, R209K, R118K and R125K are identified in TIB as 
sites for mutagenesis within the scope of this invention. The 
residues are substituted below in section 2, and further analysis 
done. The subset ACTSITE contains no lysines. 

Non- conservative substitutions: 
makeKzone2 .bcl 

1 #sourcefile makezone2 .bcl Claus von der Osten 961128 

2 # 

3 #having scanned lists (grep arg command) and identified sites for 
lys->arg substitutions 

4 #N0TE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace TIB newmodel 

6 Biopolymer 

7 #N0TE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnext lines with lys->arg positions 

10 Replace Residue newmodel: 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 

15 Replace Residue newmodel: 179 lys L 
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16 Replace Residue newmodel : 209 lys L 

17 # 

18 #Now repeat analysis done prior to arg->lys / now including 
introduced lysines 

5 19 Color Molecule Atoms newmodel Specified Specification 255,0,255 

20 Zone Subset LYSx newmodel : lys :NZ Static monomer/residue 10 
Color_Subset 255,255,0 

21 Zone Subset NTERMx newmodel :1:N Static monomer/residue 10 
Color_Subset 255,255,0 

10 22 #NOTE: editnextline ACTSITEx residues according to the protein 

23 Zone Subset ACTSITEx newmodel : 146 , 201 , 258 Static monomer/ residue 
8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMx 

25 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom Output_File restxatom. list 

28 List Subset RESTx monomer/ residue Output_File restxmole . list 

29 # 

3 0 Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
2 0 31 List Subset ACTSITEx Atom Output_File actsitexatom. list 

32 List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole . list 

33 # 

34 #read restxatom. list or restxmole . list to identify sites for 
25 (not_arg) ->lys subst. if needed 

Comments : 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18, 31-33, 36, 38, 40, 48, 50, 56-62, 64, 78, 88, 91-93, 

30 104-106, 120, 136, 225, 227-229, 250, 262, 268. Of these three are 
cysteines involved in disulfide bridge formation, and consequently for 
structural reasons excluded from the residues to be mutated. The 
following mutations are proposed in H. lanuginosa lipase (TIB) : 
A18K, G31K, T32K, N33K, G38K, A40K, D48K, T50K, E56K, D57K, S58K, 

35 G59K, V60K, G61K, D62K, T64K, L78K, N88K, G91K, N92K, L93K, S105K, 
G106K, V120K, P136K, G225K, L227K, V228K, P229K, P250K, F262K. 
Relevant data for Example 10: 

# TIBNOH20 

# residue area 

40 GLUJL 110.792610 
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ASP 


"ill 


37 . 023655 




GLY 


112 


29.930408 




PHE 


113 


14 .976435 


55 


THR 


114 


10 .430954 




SER 


115 


40.606895 




SER 


116 


13 .462922 




TRP 


117 


10.747735 
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ARG_118 
SERJL19 
VAL_120 
ALA_121 
5 ASP_122 
THR_123 
LEU_124 
ARG_125 
GLN_126 

10 LYS_127 
VAL_128 
GLU_129 
ASP_130 
ALAJL31 

15 VAL_132 
ARG_133 
GLU_134 
HIS_135 
PRO_136 

2 0 ASP_137 

TYR_13 8 
ARG_13 9 
VAL_140 
VAL_141 
25 PHE_142 
THR_143 
GLY_144 
HIS_145 
SER_146 

3 0 LEU_147 

GLY_14 8 
GLY__149 
ALA_150 
LEUJL51 
35 ALA_152 
THR_153 
VAL_154 
ALA_155 
GLY_156 

4 0 ALA_157 

ASP_158 
LEU_159 
ARG_160 
GLY_161 

4 5 ASN_162 
GLY_163 
TYR_164 
ASP_165 
ILE_166 

50 ASP_167 
VAL_168 
PHE__169 
SER_170 
TYR_171 

55 GLY_172 
ALA_173 
PRO__174 
ARG 175 



114.364281 

46.880249 

13 .434669 

18.258261 

110.753098 

69.641922 

17.090784 

73 .929977 

101.320190 

84 .450241 

6.448641 

47.700993 

75.529091 

11.340775 

27 .896025 

153.136490 

132.140594 

54 .553406 

97.386963 

22.653191 

35.392658 

74 .321243 

10.173222 

0.233495 

3.224321 

0.000000 

0.000000 

4.514527 

15.749787 

40.709171 

0.000000 

0 .000000 

0.537387 

22.838938 

0.268693 

18.078798 

7.254722 

0.000000 

0 .000000 

15.140230 

41.645477 

6.144750 

41.939716 

68 .978180 

68.243805 

79. 181274 

36.190247 

103 .068283 

0.000000 

24.326443 

4 .299094 

0.466991 

3 .339332 

0 .000000 

0 .000000 

12.674671 

13.117888 

10.004488 
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VAL_176 
GLY_177 
ASN_178 
ARG_179 
5 ALA_180 
PHE_181 
ALA_182 
GLU__183 
PHE_184 

10 LEU_185 
THR_186 
VAL_187 
GLN_188 
THR_189 

15 GLY_190 
GLY_191 
THR_192 
LEU_193 
TYR_194 

2 0 ARG_195 
ILE_196 
THR_197 
HIS_198 
THR_199 

25 ASN_200 
ASP_2 01 
ILE_2 02 
VAL_2 03 
PRO_204 

30 ARG_205 
LEU_2 06 
PRO_2 07 
PRO_2 08 
ARG_2 09 

35 GLU_210 
PHE_211 
GLY_212 
TYR_213 
SER_214 

40 HIS_215 
SER_216 
SER_217 
PRO_218 
GLU_219 

45 TYR_220 
TRP_221 
ILE__222 
LYS_223 
SER_224 

50 GLY_225 
THR_226 
LEU_227 
VAL_228 
PRO_229 

55 VAL_2 3 0 
THR_231 
ARG__232 
ASN 233 



21.422220 

2.680759 

21.018063 

110.282166 

33 .210381 

4.567788 

3 .897251 

76.354004 

71.225983 

24.985012 

47.023815 

98.244606 

54.152954 

88.660645 

24.792120 

10.726818 

45 .458744 

16 .633211 

34.829491 

29.030851 

1.973557 

3 .493014 

1.532270 

34 .785877 

39.789238 

0 .000000 

31.168434 

29.521076 

3 .515322 

44.882454 

51.051746 

12 .575329 

43 .259636 

113 .700233 

154.628540 

112 .505188 

30.084938 

3 .268936 

12 .471436 

23 .354481 

16 .406200 

14.665598 

17 .240993 

13 .145291 

18.718306 

39.229233 

5 .105175 

120 .739983 

15.407301 

29.306646 

66.806862 

122 .682808 

60.923004 

104 .620377 

23 .398251 

63 .372971 

80.357857 

89.255066 
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ASP 


234 


43 011250 


ile" 


235 


2 . 114349 


VAlf 


236 


45 140491 


LYS~ 


237 


105 651306 


ILE" 


238 


24 671705 


GLU" 


239 


116 891907 


GLY" 


240 


31 965794 


ILE~ 


241 


46 278099 




■£ *X «£ 


28 963699 


AT.A~ 


243 


25 158146 


THR~ 


244 


98 351440 


GLY~ 


245 


43 842186 


GLY~ 


246 


0 700486 


asn" 


247 


3 926274 


ASN~ 


248 


51 047890 


gln" 


249 


66 699188 


pro" 


250 


132 414047 


asn" 


251 


70 213730 


ILE~ 

X -1—1 i-J 


252 


141 498062 

X M X » T J U U U ti 


pro" 


253 


59 089233 


asp" 


£t J *I 


59 010895 


ile" 


255 
—j —j 


63 298943 


pro" 


256 


78 608688 


ALA" 


257 


0 806080 


his" 


258 


3 761708 


leu" 


259 


50 747856 


TRP~ 


260 


35 229710 


TYR~ 


261 


5 .440791 


PHE~ 


262 


36 . 457939 


HT.Y - 

Oil ± 


U J 




leu" 


"264 


109.148178 


ILE 


"265 


2.418241 


GLY 


"266 


17.730062 


thr" 


"267 


68.217873 


CYS 


"268 


15.418195 


LEU 


"269 


165.990997 



Subset REST: 

restmole . list 
Subset REST: 

4 0 TIB :5, 8-9, 13-14, 16, 18-20,31-34,36,38,40,48-50,56-66,68,76-79,88, 

91-93, 

TIB: 100 -107, 116 -117, 119-121, 132-134, 136, 13 9-142, 154-169,177-185, 

TIB: 187, 189-191, 207-212,214-216,225,227-22 9,241-244,250,262,268 
restatom. list 
45 Subset REST: 

TIB:ASP 5:N,CA,C,0,CB,CG,ODl,OD2 

TIB : ASN 8 : N, CA, C , 0, CB , CG , 0D1 , ND2 

TIB :GLN 9 : N, CA, C , 0, CB , CG , CD, 0E1 , NE2 

TIB : PHE 13 : N, CA, C, 0, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ 
50 TIB : ALA 14 : N, CA, C, 0, CB 

TIB : TYR 16 : N , CA , C , 0 , CB , CG , CD1 , CD 2 , CE1 , CE2 , CZ , OH 

TIB : ALA 18 :N, CA, C, O, CB 

TIB : ALA 19 :N, CA, C, O, CB 

TIB : ALA 20 :N, CA, C, 0, CB 
55 TIB : GLY 31:N,CA,C,0 

TIB : THR 32 :N, CA, C, O, CB, OG1 , CG2 

TIB : ASN 33 :N, CA, C, O, CB, CG, OD1 , ND2 

TIB : ILE 34:N,CA,C,0,CB, CGI , CG2 , CD1 



TIB:CYS 36:N,CA,C,0,CB,SG 

TIB :GLY 38:N,CA,C,0 

TIB : ALA 40 : N, CA, C, 0, CB 

TIB: ASP 48 :N,CA,C, 0,06,00,001,002 
5 TIB : ALA 49 :N, CA, C, 0, CB 

TIB:THR 50:N,CA,C,O,CB,OGl,CG2 

TIBrGLU 56:N,CA,C,0,CB,CG,CD,OEl,OE2 

TIB: ASP 57:N,CA,C,0,CB,CG,ODl,OD2 

TIB : SER 58 : N, CA, C, O, CB, OG 
10 TIB :GLY 59:N,CA,C,0 

TIB : VAL 60 : N, CA, C, O , CB , CGI , CG2 

TIB :GLY 61:N,CA,C,0 

TIB: ASP 62 :N, CA, C, 0, CB, CG, 0D1 , 0D2 

TIB : VAL 63 :N, CA, C, O, CB, CGI , CG2 
15 TIB :THR 64 : N, CA, C, O, CB, 0G1 , CG2 

TIB :GLY 65:N,CA,C,0 

TIB : PHE 66 : N , CA, C, O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB : ALA 68 : N, CA, C, O, CB 

TIB: ILE 76:N,CA,C,0,CB,CG1,CG2,CD1 
20 TIB : VAL 77 : N, CA, C, 0, CB, CGI , CG2 

TIB : LEU 78 : N, CA, C, 0, CB, CG, CD1 , CD2 

TIB : SER 79 :N, CA, C, O, CB, OG 

TIB : ASN 88 :N, CA, C, O, CB, CG, 0D1 , ND2 

TIB : GLY 91:N,CA,C,0 
25 TIB : ASN 92 : N, CA, C , 0 , CB , CG, 0D1 , ND2 

TIB: LEU 93 :N, CA, C, 0, CB, CG, CD1 , CD2 

TIB : ILE 100 :N, CA, C, O, CB, CGI , CG2 , CD1 

TIB : ASN 101 :N, CA, C, O, CB , CG, 0D1, ND2 

TIB: ASP 102 :N, CA, C, O, CB, CG, 0D1, 0D2 
30 TIB : ILE 103 :N, CA, C, O, CB, CGI , CG2 , CD1 

TIB:CYS 104:N,CA,C,O,CB,SG 

TIB : SER 105 :N, CA, C, O, CB , OG 

TIB : GLY 106:N,CA,C,O 

TIB:CYS 107:N,CA,C,O,CB,SG 
35 TIB : SER 116 :N, CA, C, O, CB , OG 

TIB : TRP 117 :N, CA, C, O, CB , CG, CD1 , CD2 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 

TIB : SER 119 :N, CA, C , O, CB , OG 

TIB : VAL 120 :N, CA, C, 0, CB, CGI , CG2 

TIB : ALA 121 :N, CA, C, O, CB 
40 TIB : VAL 132 :N, CA, C, O, CB , CGI , CG2 

TIB : ARG 133 :N, CA, C,0, CB, CG, CD,NE, CZ,NH1 / NH2 

TIB:GLU 134 :N, CA, C, O, CB , CG, CD, OE1 , OE2 

TIB:PRO 136^^,00,0,0^,00 

TIB: ARG 139 :N, CA, C , O, CB , CG, CD , NE , CZ , NH1 , NH2 
45 TIB : VAL 140 :N, CA, C , O, CB , CGI , CG2 

TIB : VAL 141 :N, CA, C, 0, CB, CGI , CG2 

TIB : PHE 142 :N, CA, C, O, CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ 

TIB : VAL 154 :N, CA, C, O, CB , CGI , CG2 

TIB : ALA 155 :N, CA, C, O, CB 
50 TIB: GLY 156:N,CA,C,0 

TIB : ALA 157 : N, CA, C , 0 , CB 

TIB: ASP 158 :N, CA, C, O, CB , CG, ODl , OD2 

TIB: LEU 159 :N, CA, C, O, CB , CG, CD1 , CD2 

TIB: ARG 160 :N, CA, C , O, CB , CG, CD , NE , CZ , NH1 , NH2 
55 TIB :GLY 161:N,CA,C,0 

TIB: ASN 162 :N, CA, C, O, CB , CG, ODl , ND2 

TIB :GLY 163:N,CA,C,0 

TIB : TYR 164 :N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
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TIB: ASP 165 :N, CA, 0,0,06, CG, 0D1 , 0D2 

TIB : ILE 166:N,CA,C,0,CB, CGI , CG2 , CD 1 

TIB: ASP 167 :N, OA, C, O, CB, CG, 0D1 , 0D2 

TIB : VAL 168 : N, OA, C , O, CB , CGI , CG2 
5 TIB : PHE 169 :N, OA, C, 0, CB, CG, CD1, CD2 , CE1 , CE2 , CZ 

TIB :GLY 177:N,CA,C,0 

TIB : ASN 178 : N, CA, C, 0, CB , CG, 0D1 , ND2 

TIB : ARG 179 : N, CA, C, 0, CB , CG, CD, NE, CZ , NH1 , NH2 

TIB : ALA 180 :N, CA, C, O, CB 
10 TIB : PHE 181 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB : ALA 182 :N, CA, C, 0, CB 

TIB :GLU 183 :N, CA, C, O, CB, CG, CD, 0E1, 0E2 

TIB : PHE 184 :N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB: LEU 185 : N, CA, C, 0, CB , CG, CD1 , CD2 
15 TIB : VAL 187 : N, CA, C, 0, CB , CGI , CG2 

TIB :THR 189 :N, CA, C, 0, CB, 0G1 , CG2 

TIB : GLY 190:N,CA,C,O 

TIB : GLY 191:N,CA,C,0 

TIB:PRO 207:N,CA / CD,C / O,CB,CG 
20 TIB:PRO 208 :N, CA, CD, C, 0, CB, CG 

TIB: ARG 209 : N, CA, C, O, CB , CG, CD, NE, CZ , NH1 , NH2 

TIB :GLU 210 : N, CA, C, 0, CB , CG, CD, 0E1 , 0E2 

TIB: PHE 2 1 1 : N , CA , C , 0 , CB , CG , CD 1 , CD2 , CE 1 , CE 2 , CZ 

TIB: GLY 212:N,CA,C,0 
25 TIB : SER 214 : N, CA, C, 0, CB , OG 

TIB: HIS 215 :N, CA, C, 0, CB , CG, ND1 , CD2 , CE1 , NE2 

TIB : SER 216 :N f CA, C, O, CB , OG 

TIB : GLY 225:N,CA,C,0 

TIB: LEU 227 : N, CA, C, O, CB , CG, CD1 , CD2 
30 TIB : VAL 228 :N, CA, C, 0, CB, CGI , CG2 

TIB: PRO 229:N,CA,CD,C,0,CB,CG 

TIB : ILE 241 :N, CA, C, O, CB , CGI , CG2 , CD1 

TIB: ASP 242 : N, CA, C, 0, CB , CG, 0D1 , 0D2 

TIB : ALA 243 :N, CA, C, 0, CB 
35 TIB : THR 244 :N, CA, C, O, CB, 0G1 , CG2 

TIB: PRO 250 :N, CA, CD, C, 0, CB , CG 

TIB : PHE 262 : N, CA, C, O, CB, CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB:CYS 268 :N, CA, C, O, CB, SG 
Subset SUB5B: 
4 0 subSmole . list 

Subset SUB5B: 

TIB: 3 -4, 6-7, 10-12, 15,22-23,25-3 0,35,37,39,41-42,44-47, 51-55, 67, 
69-70, 

TIB : 72, 74 -75, 94 -99, 108 -112 ,114-115, 118 ,122 -126, 128-131, 135, 137- 

45 138, 

TIB: 186, 188, 192 -195, 213,217-219,223-224,230-231,234-235,238-240, 
TIB:245,269 

subSbat om .list 
Subset SUB5B: 
50 TIB : SER 3 : N, CA, C, 0, CB , OG 

TIB :GLN 4 : N, CA, C, 0, CB, CG, CD, OE1 , NE2 
TIB: LEU 6 :N, CA, C, 0 , CB , CG, CD1 , CD2 
TIB: PHE 7 : N , CA , C , O , CB , CG , CD1 , CD2 , CE 1 , CE2 , CZ 
TIB : PHE 10 : N, CA, C, O, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ 
55 TIB : ASN 11 :N, CA, C, O, CB, CG, OD1 , ND2 

TIB: LEU 12 :N, CA, C, 0, CB , CG, CD1 , CD2 
TIB :GLN 15 :N, CA, C, O, CB, CG, CD, OE1 , NE2 
TIB:CYS 22 : N, CA, C, O, CB , SG 
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TIB :GLY 23 :N,CA,C,0 

TIB : ASN 25 :N, CA, C, 0, CB , CG, 0D1 , ND2 

TIB : ASN 26 :N, CA, C, 0, CB , CG, 0D1 , ND2 

TIB: ASP 27 :N, CA, C, O, CB, CG, 0D1 , 0D2 
5 TIB : ALA 28 :N, CA, C, O, CB 

TIB: PRO 29:N,CA,CD,C,0,CB,CG 

TIB : ALA 30 :N, CA, C, 0, CB 

TIB : THR 35 :N, CA, C, 0, CB , OG1 , CG2 

TIB : THR 37 :N, CA, C, O, CB, OG1 , CG2 
10 TIB : ASN 39 :N, CA, C, O, CB, CG, OD1 ,ND2 

TIB:CYS 41:N,CA,C,0,CB,SG 

TIB: PRO 42 : N, CA, CD, C, O, CB , CG 

TIB : VAL 44 :N, CA, C, 0, CB , CGI , CG2 

TIB :GLU 45 :N, CA, C, 0, CB , CG, CD, OE1 , OE2 
15 TIB :LYS 46 :N, CA, C, 0, CB , CG, CD, CE, NZ 

TIB : ALA 47 :N, CA, C, 0, CB 

TIB : PHE 51 :N, CA, C, O, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ 
TIB: LEU 52 : N, CA, C, O, CB, CG, CD1 , CD2 

TIB :TYR 53 : N , CA, C , 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 
20 TIB : SER 54 :N, CA, C, 0, CB , OG 

TIB : PHE 55 :N, CA, C, 0, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB: LEU 67 : N, CA, C, O, CB, CG, CD1 , CD2 

TIB: LEU 69 : N, CA, C, O, CB, CG, CD1 , CD2 

TIB: ASP 70 : N, CA, C, 0, CB , CG, OD1 , OD2 
25 TIB : THR 72 : N, CA, C, 0 , CB , 0G1 , CG2 

TIB : LYS 74 :N, CA, C, 0, CB, CG, CD, CE, NZ 

TIB : LEU 75 :N, CA, C, 0, CB , CG, CD1 , CD2 

TIB : ASN 94 :N, CA, C, 0, CB, CG, 0D1 , ND2 

TIB : PHE 95 : N, CA, C , 0 , CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ 
30 TIB:ASP 96 : N, CA, C, 0 , CB , CG, 0D1 , 0D2 

TIB : LEU 97 :N, CA, C, 0, CB , CG, CD1 , CD 2 

TIB: LYS 9 8 : N , CA , C , 0 , CB , CG , CD , CE , NZ 

TIB :GLU 99 :N, CA, C, O, CB , CG, CD, 0E1 , 0E2 

TIB : ARG 108 : N, CA, C, O, CB , CG, CD , NE , CZ , NH1 , NH2 
35 TIB :GLY 109:N,CA,C,O 

TIB : HIS 110 :N, CA, C, O, CB, CG, ND1 , CD2 , CE1 , NE2 

TIB: ASP lll:N,CA,C,0,CB,CG / ODl,OD2 

TIB :GLY 112:N,CA,C,0 

TIB :THR 114 :N, CA, C, O, CB, OG1 , CG2 
40 TIB : SER 115 :N, CA, C, O, CB, OG 

TIB: ARG 118 :N, CA, C, 0, CB, CG, CD,NE, CZ,NH1, NH2 

TIB : ASP 122 : N , CA, C , O , CB , CG, OD1 , OD2 

TIB : THR 123 :N, CA, C, O, CB, OG1 , CG2 

TIB: LEU 124 :N, CA, C , O, CB , CG, CD1 , CD2 
45 TIB : ARG 125 :N, CA, C, O, CB, CG, CD , NE, CZ , NH1 , NH2 

TIB: GLN 1 2 6 : N , CA , C , 0 , CB , CG , CD , OE 1 , NE2 

TIB : VAL 128 :N, CA, C, 0, CB, CGI , CG2 

TIB: GLU 1 2 9 : N , CA , C , O , CB , CG , CD , OE 1 , OE2 

TIB: AS P 130:N,CA,C,O,CB,CG,ODl,OD2 
50 TIB : ALA 131 :N, CA, C , O, CB 

TIB: HIS 135 :N, CA, C, 0, CB , CG, ND1 , CD2 , CE1 , NE2 

TIB : ASP 137 :N, CA, C, 0, CB , CG, ODl , OD2 

TIB : TYR 13 8 : N, CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
TIB : THR 186 :N, CA, C, O, CB, OG1 , CG2 
55 TIB : GLN 188 :N, CA, C, O, CB, CG, CD, 0E1 , NE2 

TIB : THR 192 : N, CA, C , 0 , CB , OG1 , CG2 
TIB: LEU 193 :N, CA, C, 0, CB, CG, CD1, CD2 
TIB : TYR 1 94 : N, CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
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TIB : ARG 195 : N, CA, C, O , CB , CG, CD, NE , CZ , NH1 , NH2 
TIB : TYR 2 13 : N, CA, C , O , CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ , OH 
TIB : SER 217 :N, CA, C, O, CB, OG 
TIB:PRO 218:N,CA,CD,C,0,CB,CG 
5 TIB:GLU 219:N / CA,C,0 / CB,CG,CD,OEl,OE2 

TIB : LYS 223 : N, CA, C, 0, CB , CG, CD, CE , NZ 
TIB : SER 224 :N, CA, C, 0, CB, OG 
TIB : VAL 23 0 : N, CA, C , 0 , CB , CGI , CG2 
TIB :THR 23 1 : N, CA, C, 0 , CB , 0G1 , CG2 
10 TIB:ASP 234:N,CA,C,0,CB,CG,ODl,OD2 

TIB : ILE 235 : N, CA, C, O, CB , CGI , CG2 , CD1 
TIB : ILE 2 3 8 : N, CA , C , O , CB , CGI , CG2 , CD1 
TIB :GLU 239 :N, CA, C, O, CB, CG, CD, 0E1 , 0E2 
TIB :GLY 240:N,CA,C,O 
15 TIB :GLY 245:N,CA,C,0 

TIB: LEU 269 : N, CA, C, O, CB, OXT, CG, CD1 , CD2 
Subset ACTSITE: 

actsitemole . list 
Subset ACTSITE: 

2 0 TIB: 17 ,21, 80-87 ,89-90, 113, 143 -153, 170-176, 196-206, 221-222, 226, 

246-249, 

TIB:251-261, 263-267 
actsiteatom. list 
Subset ACTSITE: 
25 TIB : SER 17 : N, CA, C , O, CB , OG 

TIB : TYR 2 1 : N, CA, C , O , CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ , OH 

TIB : PHE 8 0 : N, CA, C , O , CB , CG, CD1 , CD2 , CE1 , CE2 , CZ 

TIB: ARG 81 :N, CA, C, O, CB, CG, CD, NE, CZ, NH1 , NH2 

TIB :GLY 82:N,CA,C,0 
30 TIB : SER 83 : N, CA, C, O, CB , OG 

TIB: ARG 84 : N, CA, C, O, CB, CG, CD, NE, CZ , NH1 , NH2 

TIB : SER 85 : N, CA, C , O, CB , OG 

TIB : ILE 86 : N, CA, C , O , CB , CGI , CG2 , CD1 

TIB :GLU 87 : N, CA, C, O, CB, CG, CD, OE1 , OE2 
3 5 TIB : TRP 8 9 : N, CA, C , 0 , CB , CG, CD1 , CD 2 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 

TIB : ILE 90 : N, CA, C, O, CB, CGI , CG2 , CD1 

TIB : PHE 113 : N, CA, C, 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ 

TIB :THR 143 :N, CA, C, 0, CB, OG1 , CG2 

TIB :GLY 144:N,CA,C,0 
40 TIB:HIS 145 : N, CA, C , 0 , CB , CG, ND1 , CD2 , CE1 , NE2 

TIB : SER 146 : N, CA, C, 0, CB, OG 

TIB : LEU 14 7 : N, CA, C , 0 , CB , CG , CD1 , CD 2 

TIB :GLY 148:N,CA,C,0 

TIB :GLY 149:N,CA,C,0 
45 TIB : ALA 150 : N, CA, C, 0, CB 

TIB: LEU 151 :N, CA, C, 0, CB, CG, CD1 , CD2 

TIB : ALA 152 : N, CA, C, 0, CB 

TIB : THR 153 : N, CA, C, 0, CB , OG1 , CG2 

TIB : SER 170 : N, CA, C, 0, CB, OG 
5 0 TIB : TYR 17 1 : N , CA, C , 0 , CB , CG , CD1 , CD2 , CE1 , CE2 , CZ , OH 

TIB : GLY 172:N,CA,C,0 

TIB : ALA 173 : N, CA, C, 0, CB 

TIB:PRO 174 : N, CA, CD, C, O, CB , CG 

TIB: ARG 175 : N, CA, C, 0, CB, CG, CD, NE, CZ , NH1 , NH2 
55 TIB : VAL 176 :N, CA, C, 0, CB, CGI , CG2 

TIB: ILE 196:N,CA,C,0,CB,CG1,CG2,CD1 

TIB: THR 197 : N, CA, C, 0, CB, OG1 , CG2 

TIB: HIS 198:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
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TIB : THR 199 : N, CA, C, O, CB , 0G1 , CG2 
TIB : ASN 200 : N, CA, C, O, CB , CG, OD1 , ND2 
TIB: ASP 201 :N,CA,C, 0,06,00,001,002 
TIB : ILE 202:N,CA,C,O, CB, CGI , CG2 , CD1 
5 TIB : VAL 203 :N, CA, C, O, CB, CGI , CG2 

TIB:PRO 204 :N,CA,CD,C,0,CB,CG 
TIB : ARG 205 :N, CA, C, O, CB, CG, CD, NE, CZ , NH1 , NH2 
TIB : LEU 2 0 6 : N , CA , 0 , O , CB , CG , CD1 , CD2 

TIB : TRP 221 : N, CA, C, O, CB , CG, CD1 , CD2 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 
10 TIB : ILE 222 : N, CA, C, O, CB , CGI , CG2 , CD1 

TIB : THR 226 : N, CA, C, O, CB , OG1 , CG2 

TIB :GLY 246:N,CA,C,0 

TIB: ASN 247 : N, CA, C, O, CB , CG, OD1 , ND2 

TIB: ASN 248 : N, CA, C, O, CB , CG, OD1 , ND2 
15 TIB : GLN 249 :N, CA, C, O, CB , CG, CD, OE1 , NE2 

TIB: ASN 251 : N, CA, C, O, CB , CG, OD1 , ND2 

TIB : ILE 252 : N, CA, C, O, CB, CGI , CG2 , CD1 

TIB:PRO 253 : N, CA, CD , C , O, CB, CG 

TIB: ASP 254:N,CA,C,0,CB,CG,ODl,OD2 
20 TIB : ILE 255 : N, CA, C, O, CB , CGI , CG2 , CD1 

TIB:PRO 256:N,CA,CD / C,0,CB,CG 

TIB : ALA 257 : N, CA, C, O, CB 

TIB: HIS 258:N,CA,C,0,CB,CG,ND1 / CD2,CE1,NE2 
TIB : LEU 259 : N, CA, C, O, CB, CG, CD1 , CD2 
2 5 TIB : TRP 260 : N, CA, C, O, CB , CG, CD1 , CD2 , NE1 , CE2 , CE3 , CZ2 , CZ3 , CH2 

TIB : TYR 261 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 
TIB :GLY 263:N,CA,C,0 
TIB: LEU 264 : N, CA, C, O, CB , CG, CD1 , CD2 
TIB : ILE 265 : N, CA, C, O, CB , CGI , CG2 , CD1 
30 TIB :GLY 266:N,CA,C,0 

TIB : THR 267 : N, CA, C, O, CB , OG1 , CG2 
Subset RESTX: 

restxmole . list 
Subset RESTX: 

3 5 NEWMODEL : 14 ,16, 18 -20, 3 1-34, 36, 38, 4 0,48-50, 56- 66 ,68, 78-79 ,88, 91- 

93, 

NEWMODEL: 104 -106, 120, 136,225,227-229,250,262,268 
restxatom. list 
Subset RESTX: 

4 0 NEWMODEL: ALA 14 : N, CA, C, O, CB 

NEWMODEL : TYR 16 : N, CA, C, O, CB , CG, CD1 , CD2 , CE1 , CE2 , CZ , OH 

NEWMODEL: ALA 18 : N, CA, C, O, CB 

NEWMODEL: ALA 19 : N, CA, C, O, CB 

NEWMODEL : ALA 20 : N, CA, C, 0 , CB 
45 NEWMODEL : GLY 31:N,CA,C,0 

NEWMODEL : THR 32 : N, CA, C, O, CB , OG1 , CG2 

NEWMODEL :ASN 33 : N, CA, C, O, CB , CG, OD1 , ND2 

NEWMODEL : ILE 34 : N, CA, C, O, CB , CGI , CG2 , CD1 

NEWMODEL : CYS 3 6 : N, CA, C , O, CB , SG 
50 NEWMODEL :GLY 38:N,CA,C,0 

NEWMODEL: ALA 40 : N, CA, C, O, CB 

NEWMODEL: ASP 48 : N, CA, C, 0, CB, CG, OD1 , OD2 

NEWMODEL: ALA 49 : N, CA, C, O, CB 

NEWMODEL : THR 50 :N, CA, C, O, CB, OG1 , CG2 
55 NEWMODEL :GLU 56 : N, CA, C, O, CB , CG, CD, OE1 , OE2 

NEWMODEL : ASP 57 :N, CA, C, O, CB, CG, OD1 , OD2 

NEWMODEL : SER 58 :N, CA, C, O, CB, OG 

NEWMODEL : GLY 59:N,CA,C,0 
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NEWMODEL : VAL 60 :N, CA, C, O, CB, CGI , CG2 
NEWMODEL : GLY 61:N,CA,C,0 
NEWMODEL : ASP 62 :N, CA, C, O, CB, CG, OD1 , OD2 
NEWMODEL: VAL 63 :N, CA, C, O, CB, CGI , CG2 
5 NEWMODEL : THR 64 :N, CA, C, O, CB, OG1 , CG2 

NEWMODEL :GLY 65:N,CA,C,0 

NEWMODEL : PHE 66 :N, CA, C, O, CB, CG, CD1 , CD 2 , CE1 , CE2 , CZ 

NEWMODEL : ALA 68 :N, CA, C, O, CB 

NEWMODEL : LEU 78 : N, CA, C , 0 , CB , CG , CD1 , CD2 
10 NEWMODEL : SER 79 :N, CA, C, O, CB, OG 

NEWMODEL : ASN 8 8 : N , CA , C , O , CB , CG , OD 1 , ND2 

NEWMODEL :GLY 91:N,CA,C,0 

NEWMODEL : ASN 92:N,CA,C,0,CB,CG,0D1,ND2 

NEWMODEL : LEU 93 :N, CA, C, 0, CB, CG, CD1 , CD2 
15 NEWMODEL : CYS 104 :N, CA, C, O, CB , SG 

NEWMODEL: SER 105 :N, CA, C, O, CB, OG 

NEWMODEL: GLY 106:N,CA,C,O 

NEWMODEL : VAL 120 :N, CA, C, O, CB, CGI , CG2 

NEWMODEL : PRO 1 3 6 : N , CA , CD , C , O , CB , CG 
2 0 NEWMODEL: GLY 225:N,CA,C,0 

NEWMODEL : LEU 2 2 7 : N , CA , C , O , CB , CG , CD 1 , CD2 

NEWMODEL : VAL 2 2 8 : N , CA , C , O , CB , CGI , CG2 

NEWMODEL : PRO 2 2 9 : N , CA , CD , C , O , CB , CG 

NEWMODEL : PRO 2 5 0 : N , CA , CD , C , O , CB , CG 
2 5 NEWMODEL : PHE 262 : N, CA, C, O, CB , CG, CD1 , CD 2 , CE1 , CE2 , CZ 

NEWMODEL: CYS 268 : N, CA, C , O, CB , SG 



Example 11 

Providing a lipase variant E87K+D254K 
30 The Humicola lanuginosa lipase variant E87K+D254K was 

constructed, expressed and purified as described in WO 92/05249. 



Example 12 

Lipase-S-PEG 15,000 conjugate 
35 The lipase variant E87K+D254K-SPEG conjugate was prepared as 

described in Example 7, except that the enzyme is the Humicola 
lanuginosa lipase variant (E87K+D2 54K) described in Example 11 and the 
polymer is mPEG 15,000. 



40 Example 13 

Immunogenicity assessed as IgG i of lipase variant (D87K+D254K) in Balb/C 
mice 

Balb/c mice were immunized by subcutaneous injection of: 

(i) 50 microliters 0.9% (wt/vol) NaCl solution (control group, 8 
4 5 mice) (control) , 

(ii) 50 microliters 0.9% (wt/vol) NaCl solution containing 25 
micrograms of protein of a Humicola lanuginosa lipase variant 
(E87K+D254K) (group 1, 8 mice) (unmodified lipase variant), 
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(iii) 50% 0.9% (wt/vol) NaCl solution containing a Humicola 
lanugoinosa lipase variant substituted in position D87K+D254K and 
coupled to an N-succinimidyl carbonate activated mPEG 15, 000 (group 2, 8 
mice) (lipase-SPEG 15,000). 
5 The amount of protein for each batch was measured by optical 

density measurements. Blood samples (200 microliters) were collected 
from the eyes one week after the immunization, but before the 
following immunization. Serum was obtained by blood clotting, and 
cent r i f ugat ion . 

10 The IgGi response was determined by use of the Balb/C mice IgGi 

ELISA method as described above. 



Results : 

Five weekly immunizations were required to elicit a detectable 
15 humoral response to the unmodified Humicola lanuginosa variant. The 
antibody titers elicited by the conjugate (i.e. lipase-SPEG15, 000 
ranged between 960 and 1920, and were only 2 to 4x lower than the 
antibody titer of 3840 that was elicited by unmodified HL82-LIPOLASE 
(figure to the left) . 
2 0 The results of the tests are shown in Figure 1. 

As will be apparent to those skilled in the art, in the light of 
the foregoing disclosure, many alterations and modifications are 
possible in the practice of this invention without departing from the 
spirit or scope thereof. Accordingly, the scope of the invention is to 
25 be construed in accordance with the substance defined by the following 
claims . 
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